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A COMPARISON OF THE ATTITUDES AND INTERESTS OF 
NINTH-GRADE STUDENTS OVER TWO DECADES 


MARY COVER JONES 


Department of Education and Institute of Human Development, 
University of California, Berkeley 


This paper compares the interests, ac- 
tivities and opinions of a sample of ninth 
graders in 1935 with those of ninth graders 
in the same junior high school in 1953* and 
again in 1959. The 1935 sample consists of 
members of the Oakland Growth Study 
(1932-1939). As adults, this group is now 
cooperating in a follow-up study in which 
records are being obtained of their con- 
temporary interests and attitudes and 
their estimate of how they believe they an- 
swered similar items when they were ado- 
lescents. To evaluate the consistency of the 
records, it seems important to consider 
trends which may have occurred as a result 
of social changes. Tests of new groups of 
the same age as the adolescents in our orig- 
inal sample have provided the basis for 
this comparison survey. 

Are teenagers today more serious or 
more frivolous, more conforming or more 
rebelling, more sophisticated or more 
naive, than teenagers of a generation ago? 
This is not a question which can have a 
general answer, since trends (if they oc- 
cur) will not necessarily be in the same di- 
rection in different regions or in different 
social classes. However, where relevant 


*The comparison of 1935 with 1953 data 
is more extensive, including sampling de- 
scriptions on the basis of census tract analy- 
sis of residence, intelligence test scores and 
ratings of parents’ occupation. The 1959 test- 
ing was undertaken as a supplementary 
checking device for our 1953 interpretations 
and to bring the study up to date. 
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data are available for known samples it 
seems desirable to report them and con- 
sider their implications in relation to other 
research in this field. What adolescents 
think, do, want is of interest from many 
different theoretical and practical points of 
view. 


COMPARISON OF SAMPLES 


In a state growing as rapidly as Califor- 
nia, the population composition of some 
neighborhoods has changed so greatly that 
temporal trends would be difficult to dis- 
entangle from differences in social groups. 
Fortunately for our purpose, the neighbor- 
hoods involved in this study have to a 
fairly high degree maintained the stability 
which was noted at the time this junior high 
school was chosen for the longitudinal 
study. 

Table 1 compares the distribution of par- 
ents’ occupations (Edwards, 1933) in 1935 
and 1953, and Table 2 the census tract clas- 
sifications for home addresses of our two 
samples, based on Tryon’s cluster analysis 
of the 1940 and 1950 census tract data for 
this area. An increase has occurred in pro- 
fessional and managerial occupations, and 
the total for white collar occupations has 
increased from 59 to 65%. Similarly, a 
slight upward shift has occurred in the 
neighborhood classifications; here the prin- 
cipal change has resulted from the opening 
of a neighborhood of new homes, and the 
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TABLE 1 


SocrorconoMic CLASSIFICATION: PARENTS’ 
OccuPpaTION (Epwarps 6-Point ScaLeE) 
(Expressed in percentages) 

















1935 1953 
1. Professional 07 17 
2. Managerial 18 29 
3. Semiprofessional 38 25 
4. Skilled 29 25 
5. Semiskilled 06 04 
6. Unskilled 02 00 
Median 3.12 2.59 

SD 1.04 1.12 

TABLE 2 
Census Tract CLASSIFICATION* 
(Expressed in percentages) 

| FAS 1935 1953 

1. Exclusive 768 | 21 25.6 
2. Good neighborhood | 666 | 38 41 
3. New good neighbor- | 576 | 00.6 | 14 

hood 

4. Neighborhood of 466 | 29 17 


modest homes 


® Tryon classification (8 is high): 
F = Family life, i.e., detached owner occupied home 
A = Assimilation, i.e., native white 
8 Socioeconomic independence, i.e., managerial, 
professional males 
> The four classifications listed account for the ma- 
jority of the subjects in this study. 





diminished proportion remaining in an 
older neighborhood? 

To some extent these changes reflect the 
general upward mobility in living stand- 
ards, income and education which has oc- 
curred over this 18-year period. An addi- 


? Tryon has compared eight social areas in 
his study of 1940 and 1950 census tract data 
in the San Francisco Bay Area, and has noted 
that “...in the year 1940, people who lived 
in these social areas differed greatly from 
each other.” The differences (between these 
areas) in 1940 and in 1950 remain almost ex- 
actly the same (Tryon, 1955, 1959). In a per- 
sonal communication Tryon has affirmed 
that the Oakland section from which our 
sample was drawn has remained highly stable 
by his method of analysis. 


tional contributing factor was a change in 
the elementary school sources of this junior 
high school population. Before 1940 it drew 
from five elementary schools in the north- 
ern part of Oakland. By 1953 a new ele- 
mentary school (mean occupational rating 
2.59) had been added, and one school with 
an occupational rating of 3.9 had been dis- 
placed, its pupils being sent to a different 
junior high school. This change is in the 
direction of a more homogeneous middle 
class group, slightly above the former aver- 
age. 

If these changes had resulted in a very 
marked alteration of the social structure 
of the junior high school district, we would 
expect to find significant changes in the in- 
telligence distribution. This has not oc- 
curred as is shown in Table 3. 

As a further control on the comparabil- 
ity of the two populations in socioeco- 
nomic status, a subsample was selected of 
40 cases for each sex and each period under 
study (1935 and 1953), carefully matched 
on the basis of parents’ occupation. The 
responses for the subsamples on a 139 item 
comparison were in general agreement with 
those of the larger sample. A selection of 
these items, Things to be—Occupations is 
shown in Table 4. Because of the advan- 
tages of dealing with a larger sample, it 
was decided to use the total ninth grade 
population as the basis of this report. 


TABLE 3 
INTELLIGENCE Test (IQ) CLASSIFICATION 
(Expressed in percentages) 





1935* 1953> 
130-139 02 01 
120-129 06 12 
110-119 27 30 
100-109 38 30 
90-99 21 18 
80-89 05 07 
70-79 01 02 
Mean 105.49 105.18 
SD 10.75 12.28 


® Terman Group Test, Form A and B. 
> Kuhlmann-Anderson. 
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SUBJECTS AND PROCEDURES 


Results are discussed below for an in- 
ventory consisting of some 250 items per- 
taining to activities, interests, and atti- 
tudes. Levels of significance (Walker, 1953) 
have been computed for the sexes sepa- 
rately in a comparison of earlier with later 
records. Sex differences within each tem- 
poral sample have been analyzed, as well 
as age trends over the seven-year period in 
the earlier, longitudinal sample. 

Members of the Oakland Growth study, 
72 boys and 70 girls, are represented in the 
1935 testing, when they were in the ninth 
grade. For this group an interest record 
(Jones, 1940) was administered yearly for 
seven years, an activity schedule yearly 
from the 9th through the 12th grade, and 
an opinion ballot in the 8th, 9th, and 11th 
grades. 

For the 1953 comparison, all ninth-grade 
students in the same junior high school 
were tested. The boys numbered 95 and 
the girls 78. The additional test program 
in 1959 included 134 boys and 123 girls. 
Nonwhite students, now numbering about 
20, were eliminated to maintain compara- 
bility with the earlier samples. 

The test used in 1953 and in 1959 in- 
cluded a representative selection of items 
from the three inventories used in the lon- 
gitudinal study. It was administered in one 
class period of 50 min. Students were asked 
to check items organized on separate pages 
under such headings as Where I Went Last 
Week, and What I Read Last Week. They 
were asked to indicate whether they liked, 
disliked, or were indifferent to a number of 
items listed under titles such as Things to 
Own, or Things to Be: Occupations. Head- 
ings such as Things You Talk About were 
answered on a 5-point scale from Never 
to Very often.® Attitudes were sampled 
under the heading My Opinion, which in- 
structed the Ss to report whether they ap- 

*Some of the items were adaptations and 
extensions of material included in tests by 


Furfey (1928), Lehman and Witty (1927), 
Pressey (1933), and Symonds (1928). 


ATTITUDES AND INTERESTS OVER TWO DECADES 





TABLE 4 
TuHINGs TO BE—OccuUPATIONS 
(A Comparison of Responses for 40 Selected 
Cases and for the Total Sample) 


Boys % Positive Response 
1935 1953 





N = 40|\N =72|N = 40 N = 95 


Architect | 45 49 | 37.5| 35 








Athletic Coach | 67.5 | 61 40 42 
Artist 25 31 05 09 
Aviator 67.5 68 42.5 44 
Chemist 137.58| 3 {25 | 2 
College 27.5 | 25 | 05 07 
Teacher 
Dentist 15 17 27.5] 21 
Doctor 32.5 31 35 39 
Engineer 52.5 | 53 | 60 55 
Farmer 42.5 | 37 45 28 
Lawyer | 35 41 | 25 33 
Policeman 27.5 28 17.5 23 





Girls % Positive Response 
1935 1953 





N = 40 |\N = 70'N = 40 \N = 78 





Artist 35 48 22.5 | 22 


Bookkeeper 35 36 | 20 19 
Decorator 20 30 | 45 44 
Doctor 30 22 | 25 22 
Kindergarten 35 33 | 30 35 
Teacher 
Librarian | 22.5; 23 | 20 18 
Nurse 42.5} 41 | 35 36 
Secretary 60 62 | 70 60 
Stenographer 47.5 | 59 | 40 32 
Teacher 25 30 | 22.5| 24 
Telephone 32.5 | 31 | 32.5) 29 
Operator 
Waitress 20 19 | 22.5) 20 


proved, disapproved, or sometimes ap- 
proved the behavior or ideas listed. 

The categories of items as presented were 
administrative rather than functional. 
Items belonging to a given functional clas- 
sification (such as interest in sex-social ac- 
tivities) occurred under different phrasing 
in each of the nine parts of the test. Thus, 
preoccupation with mixed group activities 
and interest in the opposite sex might pre- 
sumably be tapped by the Ss desire to own 
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new party clothes, to go to a good dance 
hall, to want to stay out late, or to talk 
about dates. Similarly, a student might in- 
dicate that he or she went to a dance or 
read a love story last week, and approved 
of “permanents.” 


RESULTS 


One fact is outstanding in our results. 
In the more recent tests, boys and girls in 
the ninth grade marked items in such a 
way as to indicate greater maturity and 
greater social sophistication. This is dem- 
onstrated in their responses to items which 
show an age trend in adolescence for the 
earlier study group. Some of these items, 
representing “juvenile” interests or im- 


mature behavior, are, in the longitudinal 
sample, rejected increasingly as boys and 
girls grow older. At the same time, other 
items which represent greater maturity of 
interests and attitudes become more fre- 
quently selected. For example, members of 
the adolescent study group, as they grew 
older, no longer wanted to “own a stamp 
collection,” to read “Boy’s Life,” to go to 
a “secret clubhouse” or to be “a detective.” 
On the other hand, as they grew older, they 
more often checked such items as this week 
“T went to a dance”; I approve of “think- 
ing about how I look”; I talk about “hav- 
ing dates” (Table 5). 

On such age-relative items, we now find 
that ninth graders in the ’50s are more 


TABLE 5 
SocraL-Sex INTERESTS 
































Boys % Positive Response Girls % Positive Response 
| 1935 1953 1959 1935 1953 1959 
| | 
| N=72 N=7 |N=10| N=10 | N=78 | N= 123 
Activities | 
I took care of my clothes* 9 21* | 2 70 | 86 ‘| 77 
I manicured my nails 0 7* | 16 62 | 74* 63* 
I went to a dance 25 27* | 40 24 41 | 61 
I read a love story 8 3* 5* 30 | 42 | 55 
Interests | 
Would like to own a phonograph 55 60* | 72 50 | «(93 92 
and dance records | 
| } 
Conversations | | 
I talk about having dates | 46 56 63 | 58 | 92 84 
Italk about clothes, thingstowear | 42 57 66 s9 | 94* | 9gi* 
I talk about girl friends | 85 71 74 | | 
I talk about boy friends 65 | 94 | 9% 
| | 
Attitudes | | | 
I approve of using lipstick | 2 6 6| «€666:~«C|:«CU88lC| C88 | 6 
I approve of ‘“‘permanents’’4 29 49 | 42* | 62 72* | 57° 
I approve of thinking about how 27 | 51 | 6 | 40 7 = 6| «C672 
you look | 
I approve of love scenes in movies} 26 | 29* | 37* | 39 | 59 | 8&8 


| 





* Check Yes 


> Never, Seldom, Sometimes, Often, Very often. Per cent equals sum of Sometimes, Often, Very often. 

© Yes, No, Sometimes. 

4 The reduced positive response for both sexes in 1959 probably reflects a change in hairdress fashion. 

* Comparisons not significant at the .05 level or better 1935-1953 and 1935-1959 are starred. Significance levels were 
not determined for the positive response percentages shown in the table, but from chi squares based on all responses 
(e.g., Yes, No, Sometimes). 
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comparable to the eleventh- or twelfth- 
grade students of 20 years earlier than to 
those of their own age (Table 5). 

Interest, activity, and attitude items 
which illustrate this tendency to check the 
more mature response have been grouped 
under subheadings such as social-sex in- 
terests, sense of responsibility, tolerance, 
and antiaggression attitudes. For each of 
these categories, one item which illustrates 
the findings has been selected for discus- 
sion. 

In the area of social-sex or heterosexual 
preoccupation, Fig. 1 shows the response 
for the item J approve of the use of lip- 
stick. For boys and girls respectively, the 
solid and broken lines indicate age trends 
in the longitudinal sample, in the 1930s. 
The responses of ninth graders in 1953 and 
1959 are also indicated; black circles rep- 
resent the percentages for boys, open cir- 
cles the percentages for girls. Lipstick sym- 
bolizes as well as any one specific item 
could, the sensitization in early adolescence 
toward a new sex role and toward being 
grown up. While its use is confined to girls, 
opinions about its use are not, as the fig- 
ure indicates. The lag shown in the data 
for boys (as compared with girls) may in- 
dicate either less interest or active antag- 
onism toward signs of maturing from mem- 
bers of the less mature sex who are being 
pressured into the heterosexual phase 
ahead of schedule. Recorded conversations 
in the 1930s suggest the latter: 


Boy to Girl: Where’s Jane? 

Girl: Down at the beauty parlor getting 
ready for the dance. 

Boy: They all look alike to me whether 
they get fixed up or not. 

Or going down the hall the first day of 
school, Roy says to Bob: There goes Irene. 
And the reply is: All painted up and looks 
like a horse. 

Adult observers of seventh graders in the 
Growth Study considered it worth a notation 
that Shirley had lipstick on. In the high 
eighth grade, however, they commented: 
The Foster sisters were the only ones at the 
party without lipstick. 


ATTITUDES AND INTERESTS OVER TWO DECADES 














3 10 " 
. Grade 


Fic. 1. Approve of using lipstick 


The data show that by the 1950s (1953 
and 1959), for both boys and girls, the per- 
centage approving of the use of lipstick was 
higher in the 9th grade than in the 11th 
grade sample 20 years earlier. It is well 
known that there are social class and re- 
gional differences in attitudes toward the 
use of lipstick and other aspects of adoles- 
cent grooming. But here we find also a tem- 
poral difference in samples living in the 
same neighborhood and comparable in so- 
cial status. 

In a recent study comparing Minnesota 
junior and senior high school student’s re- 
sponses in 1957 to those of Symond’s 
(1936) Tulsa and New York City youth in 
1935, Harris (1959) interprets similar find- 
ings: “Today youth marry younger and 
show earlier interest in social relations, love 
and marriage.” (p. 458) Our own evidence 
of interest in grooming activities among 
ninth graders shows how a temporal trend 
in heterosexual preoccupation may extend 
down into the middle period of adolescence. 

Other items in the category of social-sex 
interest are shown in Table 5. For the most 
part (except for the few comparisons indi- 
cated by asterisks) differences are signifi- 
cant between 1935 and 1953 and between 
1935 and 1959. 

When we described the junior high 
school culture of our group in the 1930s, 
we interpreted their concern with social ac- 
tivities as in conflict with achievement mo- 
tives of an academic nature. This preoccu- 
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Fic. 2. This week I studied 


pation often seemed inconsistent with an 
interest in homework, in assuming respon- 
sibilities or in showing concern for any but 
their own narrow peer group activities 
(Stolz, 1937). Does a similar situation hold 
today? Is an increased social interest ac- 
companied by a decreased commitment in 
other areas? 

Quite the contrary. These 14 year olds 
in 1953 and 1959 can be described from 
their answers as more studious, more 
broadly interested in the contemporary 
scene, more tolerant in their social atti- 
tudes, more inclined to value controlled be- 
havior and to disapprove of aggression or 
irresponsible behavior in others. Compara- 
tive results for the item This week I studied 
are shown in Fig. 2. A significantly higher 
percentage of boys and of girls in the 1950s 
reported studying. The actual percentages 
in the ninth grade were virtually the same 
as those of high school seniors in the 1930s. 
Other items indicating a greater sense of 
personal and social responsibility, as 
marked in the more recent sample, are 
shown in Table 6. There were more fre- 
quent reports of talking about their studies 
and talking about other school activities. 
They liked to go to school assemblies. Such 
responses indicate a more school-centered 
orientation in the ’50s than in the ’30s. 

A greater proportion said they earned 
money, took care of their clothes and their 
rooms, talked about what they were going 
to be when they grew up and thought it 


appropriate to worry about the future 
(Table 6). This, in prosperous times as 
contrasted with depression years and (for 
the 1953 sample) before Sputnik! 

The differences shown in Table 6 may be 
in part differences in the Zeitgeist, and in 
part a matter of situational changes. Thus, 
an increase in the number of young child- 
ren, and the development of baby-sitting, 
may be largely responsible for the increase 
(especially among girls) in “taking care of 
children” and “earning money.” The in- 
creased interest reported by both sexes 
in such items as “reading about science” 
and “talking about political matters” may 
define a generally higher level of intellec- 
tual-cultural maturity in the more recent 
junior high school samples. The markedly 
significant increases especially among girls 
in religious interests may reflect an addi- 
tional factor; the percentages reached in 
the ’50s are higher than any we obtained 
even in the 12th grade in the ’30s (Fig. 3). 

Table 6 presents data for a number of 
items illustrating different aspects of tol- 
erance. As shown in Figure 4, J approve of 
foreigners elicits a much more affirmative 
response in the ’50s, from both boys and 
girls. 

One group of items in Table 7 (approval 
of betting, gambling, etc.) suggests a more 
relaxed attitude which might be thought 
to be incompatible with the greater ex- 
pressed interest in the church and its in- 
fluences (as shown in the preceding table). 
However, this acquiescence does not extend 
to what might be classified as inconsiderate 
or aggressive behavior. The ninth graders 
of the ’50s manifest more disapproval of 
antisocial behavior such as “getting even” 
or “doing what I want when I want to” 
(Fig. 5). 

Some of the responses discussed above 
suggest a greater socialization and also 4 
more conforming attitude. The later char- 
acteristic is frequently attributed to our 
present generation of young people. But 
not all items which connote acquiescence 
are approved by the recent samples as 
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TABLE 7 
TOLERANT ATTITUDES 
Boys % Girls % 
Positive | Positive 


Response} Response 








Approving of foreigners 


Giving criminals another | 9/34/40) 6/22 |39 

chance 
Approving of women '44'72'61|76,90 88* 
doctors 


| 
| 


Tolerance toward 
gambling 
betting on games 
playing cards 
playing cards 
money 
women smoking 


6/21/34) 2) 7*| 5 
25/40/47|17|13*|/23* 
(64/86|77|66 72*|71* 
for |18/33/3411) 5*|12* 


18)35)31/26/35*|22 





* Comparisons with 1935 are starred for chi squares 
not significant at the .05 level or better. 
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Fic. 5. Getting even 


which he applauded was “the growth of 
activities” (p. 187). 


DIscussIoN 


There is little evidence in the recent lit- 
erature to support the rather favorable 
contemporary picture of adolescents as re- 
ported here. Harris’ 1957 study indicates 
that Minnesota young people are some- 
what more concerned about study-habits 
than were students in 1935 but that they 


are not much “interested.” Similarly, they 
seemed unconcerned about civic affairs. 
Remmers and Radler (1957) lament that 
teenagers, rebellious of adult authority in 
many areas, nevertheless tend to follow the 
example of adults in being illiberal and 
conformist. 

Other studies based on attitudes ques- 
tionnaires have found (among college stu- 
dents) considerable evidence of social con- 
science and of toleration for nonconforming 
behavior and ideas (Jacob, 1957; Sanford, 
1956; Webster, 1958). At the same time 
these Ss were reported to show little con- 
cern about social issues. 

The data summarized here are for a 
small sample in a single geographic location 
representing a limited range of social sta- 
tus. But if we believe what our Ss tell us, 
all is not lost for youngsters growing up in 
the ’50s. 


Sex DIFFERENCES AND Sex Roe 
PATTERNS 


Check lists of children’s interests, such 
as those employed in this study, have been 
used by a number of investigators to ex- 
amine the development of sex role con- 
cepts and sex differences. For most of the 
items in the area of social-sex development 
(listed in Table 5) the indication of mature 
interest is more marked in girls than in 
boys. More girls than boys report “going 
to a dance.” More girls than boys report 
that they talk about dates and opposite- 
sex friends. 

Earlier physiological maturing of girls 
(as compared with boys) would be ex- 
pected to produce now, as formerly, an 
earlier maturing in social-sex development. 
However, it may be that the temporal 
trend in the direction of a generally earlier 
expression of heterosexual interest for both 
sexes, contributes to some blurring of sex 
role patterns as revealed in certain aspects 
of the data for the 1950s. 

For example, this may account for the 
fact that in the 1950s boys’ responses are 
more like those of girls on some items. 
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More girls than boys checked that they 
manicured their nails, but between 1935 
and 1959 the increase in percentage check- 
ing this item was significant only for boys.‘ 

The boys’ reduced interest in some as- 
pects of scientific and mechanical manipu- 
lation (usually considered a masculine in- 
terest)" may be a temporary expression of 
earlier social maturing. To indicate inter- 
est in experimenting with a chemistry set 
or with electricity may at a certain age 
seem juvenile rather than socially mature. 
The percentages for the item experiment- 
ing with chemistry are shown in Fig. 6. The 
drop in interest for ninth-grade boys is 
significant at the .05 level in 1953 and in 
the same direction, though not significant 
at that level, in 1959. The data from both 
testings in the ’50s indicate less interest 
in such matters than at any age in the ’30s, 
even though the longitudinal sample main- 
tained a downward trend with age in items 
of this nature. 

Although greater social maturity in both 
boys and girls may be one factor which ac- 
counts for increased similarity of the sexes 
on some items, there is one area of interest 
in which results run counter to this ob- 
servation. The girls seem to have made a 
striking shift from passive to active inter- 
est in sports and athletic activities.® In the 


*The writer, who administered the tests 
on both occasions, can corroborate from ob- 
servational notes the evidence of increased 
grooming activities in boys. Increased hair 
grooming was especially in evidence. 

*In 1958 Marshall used some of the items 
of the Interest Inventory to indicate aware- 
ness of appropriate or inappropriate sex-roles 
in fifth-grade boys and girls. Both boys and 
girls rated “experimenting with electricity” 
as something which boys would like and girls 
would dislike. 

* This is the only finding in the present in- 
vestigation which is not supported by the 
school personnel from their observation and 
experience. The same teacher has been in 
charge of the girls’ physical education pro- 
gram since the early thirties; she could offer 
no explanation of any results on the basis of 
curriculum change or emphasis. However, 
several teachers and administrators cited the 
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earlier sample the percentage checking the 
positive response diminished from the 6th 
to the 12th grade. There was also a large 
sex difference in the 1935 sample, with boys 
showing more favorable attitudes. 

Figure 7 gives the results for a represen- 
tative item, showing the trend in percent- 
age wishing to own a baseball and bat. 
Data are given for the 6th through the 12th 
grades in the ’30s and also for the 9th grade 
in 1953 and 1959. Differences are significant 
for girls but not for boys (Table 8). 

Recent studies by Rosenberg and Sutton- 





effective city recreational program which has 
been built up over this period, as a possible 
contributing factor. 

A study by Meleny (1959) comparing the 
motor performance of eighth-grade girls now 
as compared to our longitudinal sample 24 
years ago found no striking differences in 
these abilities. The more recent sample how- 
ever was significantly taller and heavier. 
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TABLE 8 
INTEREST IN ATHLETICS 
Boys % Girls % 
Positive Positive 
| Response | Response 








Activities i) no 
I watched athletic (6567*62*/24/46/41 
games ae 
I played athletic (77/80*|81*|28)/79)/57 
games 


Occupations | | 
Scout leader* \44/20 |12 


Things to Own 
Baseball and bat 





Things to Do | | 
Be on a school team = /74/75*|74*|34'63/41 
Go to a playground  /74|59*|57*|/36 55/43* 

- | 





® Differences are significant for the two sexes but in 
the opposite direction. 


Smith (in press) and Rosenberg and 
Sutton-Smith (1959) of children’s interests 
in games has compared the responses of 
boys and girls in this generation with the 
responses of each sex to similar question- 
naire items in the 1920’s (Terman, 1926, 
1936). The authors report results for the 
fourth, fifth, and sixth grades which indi- 
cate that there has been a shift in the 
direction of increasing preferences of girls 
for what were formerly boys’ games while 
boys tend to select fewer items which sig- 
nificantly differentiate their choices from 
those of girls. The authors interpret their 
findings as an indication that the female 
role perception is expanding in the direc- 
tion of greater activity and increased mas- 
culinity. This tendency is shown in the 
present study for the responses listed in 
Table 8. Not only preferences but recent 
participation is reported (see also Table 
9). 

The question has been raised as to what 
the effect on the masculine role will be, if 
girls’ interests are expanded to include tra- 
ditionally masculine activities (Hartley, 


1959a, 1959b). We might conjecture that, 
for this sample at least, boys may be stak- 
ing out a new claim in the borderland of 
social respectability involving betting and 
gambling (Table 7). On the other hand, 
the increased heterosexual orientation of 
ninth-grade boys suggests that they may 
not be averse to sharing interests in some 
areas with girls. 

The overall picture indicates no striking 
changes in the number of items eliciting 
sex differences within each temporal sam- 
ple. Comparing the significant changes for 
each sex for the two samples (1935-1953) 
we find boys’ responses showing greater 
changes over time, with occupational re- 
jections by the recent sample accounting 
for rauch of this disparity (Table 9). Girls 
give relatively more positive total re- 
sponses in the later sample. 

Tyler (1955) has suggested that aware- 


TABLE 9 


Sex DIFFERENCES: SIGNIFICANT 
DIFFERENCES IN PREFERENCES, 
ETc. 1935-53 


(Test items with significant differences) 









































Boys Girls 
No.of 8 | $8] 8] 8 
Test Category } es 33 ws 33 33 
|test | 28 | 38 28) 38 
| Eh \a* |e z< 
Things I did last} 20/1 |1 |8 |0 
week 
Where I went 16 | 0 ‘2:8 12 
Things Iread | 18/2 3 16/1 
Things to do | 23 3 4/6 /1 
Occupations 47 | 0 23 |2/9 
Things you talk | 26 9 | 3 |9 |0 
about ‘ 
Places to go BE &eé 14/3 16 
Things toown | 17/1 112 2 
Magazines to| 17 | 110 |} 1 |6 
read | 
My opinions 35 - 2 jl | 2 
| 
Totals | 242 32 «= |79 ‘(53/28 


[12.5%|33%|21 % 12% 





Note.—For boys, 111 items (45% of total) showed sig- 
nificant differences; for girls, 81 items (33% of total) 
showed significant differences. 
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ness of role “enables the individual to rule 
out activities and goals not in keeping with 
it.” In her longitudinal sample, Ss disliked 
more things as they grew older. Our own 
longitudinal results support her findings to 
some extent in the case of girls but not in 
the case of boys. In the area of vocational 
choices, for example, girls decreased their 
likes significantly on 20% of the items and 
increased them on 8%, from the 6th 
through the 12th grade. Boys, on the other 
hand, decreased their likes significantly on 
only 4% of the items, and increased them 
on 15%. 

The maturity status of ninth-grade girls, 
higher than that of their male classmates, 
may be a factor in these findings. The na- 
ture of the items, the age of the Ss, and 
cultural differences, would also be deter- 
mining factors in a comparison with other 
studies. 

In relation to psychological sex-role de- 
velopment the data are not definitive. 
There is some blending of sex-role patterns 
but only as it is expressed concomitantly 
with earlier social maturing. There is some 
expansion of feminine interests into the 
male domain, but this is principally seen 
in sports activities. In the area of voca- 
tional preferences, girls’ interests seem to 
be oriented toward supplementing the fam- 
ily income rather than toward a career. 
There is some constriction of the boys’ 
vocational choices but this may be because 
we are comparing good times with the ear- 
lier depression era. 

Members of the early growth study are 
now adults, some with adolescent offspring 
of their own. In interviews they often ex- 
press their opinions about today’s young 
people in contrast to their own remem- 
bered youth. They do not need to be told 
that adolescents now are more mature so- 
cially than in their day, though they may 
exaggerate the differences. In response to 
an interview question in this area, one of 
these young men said: 


I was early-maturing but not by today’s 
standards. I think part of it is the way they 
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dress now. The girls wear lipstick and look 
older. If they had dressed that way when I 
was a kid, I would have been sure they would 
bite me. 


There may be some relationship be- 
tween the frequent complaint of these 
study members in interviews: “Why didn’t 
someone make me study more when we 
were in school?” and our findings that 
ninth graders say they do study more now. 
Strange as it may seem, one young man’s 
statement: “I didn’t know that the rea- 
son I was getting poor grades was that I 
didn’t study” is not an isolated remark. 
Having become aware of this sequential 
relationship, such individuals now apply 
parental pressures for “more studying.” 

So in a sense, the present attitudes of 
our original sample, and their contempo- 
raries, now parents and citizens, are among 
the environmental factors affecting changes 
in the directions which we have seen. Par- 
ental influences, evidence of earlier matur- 
ing (Meleny, 1959; Tanner, 1955) altered 
school programs, historical events, all give 
direction to the secular trends in interests 
and attitudes recorded in this study. 


SuMMARY 


This report compares the activities, in- 
terests and attitudes of three ninth-grade 
classes in the same school but separated 
in time by 18 and 24 years (1935, 1953, 
1959). The samples are closely similar in 
age and IQ distributions, and fairly com- 
parable in a number of social measures. 
The earlier group, members of the Oak- 
land Growth Study, provided age-trend 
data through tests given at regular inter- 
vals over a seven-year period. 

The more recent generations of ninth 
graders indicate greater maturity of het- 
erosexual interests, more serious purpose 
and a more tolerant attitude toward social 
issues. They reject more “childish” activi- 
ties. 

Alterations over time in sex-role devel- 
opment are not striking. There is some 
blending of sex-role patterns in the more 
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recent samples as expressed concomitantly 
with earlier social maturing. Currently, 
girls’ interests have expanded in the area of 
sports activities. The responses of boys in 
the ’50s indicate rejection of a greater 
number of occupations. Perhaps this is a 
reflection of occupational preferences as 
expressed during full employment as 
contrasted with the depression years. A 
greater verbal acceptance of “careers for 
women” is not accompanied by eagerness 
for a career among ninth-grade girls in the, 
present generation. A variety of influences 
must be considered in accounting for the 
changes in interests and attitudes over the 
period of the study. 
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Although many attempts have been 
made to develop abbreviated scales or 
short-forms of the Wechsler-Bellevue and 
the WAIS, there have been only a few 
studies of this type dealing with the WISC. 
Geuting (1959) reported the most valid 
(i.e., having the highest correlation with 
the Full Seale score) triads and tetrads of 
the WISC at the 742, 10%, and 13% 
year-old levels, based on Wechsler’s own 
standardization data. The short-form va- 
lidities were almost all between .90 and 
95. On the other hand, Yalowitz and Arm- 
strong (1955) found poor short-form va- 
lidities for a heterogeneous child guidance 
clinic sample ranging in age from 5 to 15 
years. Their highest coefficient was only 
61. Carleton and Stacey (1954), using a 
sample of “suspected” mental defectives 
and emotionally disturbed children, ob- 
tained validities ranging from .64 to 88 
for selected short-forms. A study of short- 
forms of the WISC with institutionalized 
mentally retarded boys was recently com- 
pleted by Smith (1959). His report fur- 
nished data from which the correlation of 
any combination of subtests can be derived 
by means of multiple regression equations. 
The average IQ in the sample was 75, but 
the range indicated that about 40% of 
Smith’s subjects would probably not be 
classifiable as mentally retarded. 

The present report concerns a compre- 


*This study was supported by a grant 
from the Indianapolis Foundation of Indian- 
apolis, Indiana, with the approval of Her- 
man Shibler, then general superintendent of 
education, Indianapolis Public Schools. 
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hensive investigation of a large number of 
short-forms of the WISC with an educable, 
mentally retarded, noninstitutionalized 
group of children. It grew out of the need 
for developing short-forms for quick re- 
evaluation of children in special classes 
for the mentally retarded in the Indian- 
apolis public school system. It provides 
information whereby the composition of 
WISC short-forms which are useful for 
this purpose may be obtained. 


SUBJECTS AND PROCEDURE 


The sample consisted of an unselected 
group of 177 children tested by the same 
psychologist and found to be eligible for 
placement in special classes for the educa- 
ble, mentally retarded in the Indianapolis 
school system. There were 112 boys and 
67 girls. Sixty-eight percent of the children 
were Negro, and 32% were white. About 
90% of the children came from schools in 
lower and lower middle-class neighbor- 
hoods. The age range was from 9 years and 
10 months to 15 years and 11 months 
with a mean of 13 years and 4 months. 
The range of Full Scale intelligence quo- 
tients on the WISC was 50 to 80 with a 
mean of 70. 

The customary 10 subtests of the WISC 
were administered to 145 of the children. 
In the 32 instances where one subtest was 
omitted, an average value was assigned. 

Statistical analyses of data were car- 
ried out on IBM equipment by the Re- 
search Computing Center of Indiana Uni- 
versity. Correlations between the Full 
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Scale score on the one hand, and each in- 
dividual subtest, and the sum of subtest 
scores for all possible duads, triads, tetrads, 
pentads, and hexads of subtests were com- 
puted. There were 45 duads, 120 triads, 
210 tetrads, 252 pentads, and 210 hexads, 
making a total of 847 coefficients. Correla- 
tions for combinations of subtests greater 
than 6 were not computed since the econ- 
omy in using a short-form of more than 
6 subtests instead of the Full Scale would 
be too small. 
RESULTS 

The correlations of individual subtests 
with the Full Scale range from .617 for 
PA? to .438 for CO. They are generally .10 
to .15 correlation units below the com- 
parable correlations reported for the stand- 
ardization sample (Wechsler, 1949), but 
the relative magnitudes are similar. The 
rank order correlation of size of the co- 
efficient between our results and the stand- 
ardization data is .60, p = .05. 

The customary procedure in a study of 
short-forms is to present the group of 
duads, triads, etc. having the highest cor- 
relations with the Full Scale. This does 
not seem to be a feasible approach in this 
paper. The range of coefficients of short- 
forms of each type is restricted. For ex- 
ample, the largest hexad coefficient is .945, 
while the 25th largest hexad coefficient is 
.932. The largest pentad coefficient is .920 
and the 25th largest is .903, and so on. The 
differences tend to become more pro- 
nounced with smaller numbers of sub- 
tests—the range of the 25 largest duad 
coefficients is .11—but they are still rela- 
tively trivial. 

Considerations other than validity are 
often of consequence in choosing a short- 
form. Administration time, handicaps in 


* Abbreviations for subtests used in this 
report are conventional: I (Information), 
C (Comprehension), A (Arithmetic), V (Vo- 
cabulary), S (Similarities), PA (Picture Ar- 
rangement), PC (Picture Completion), BD 
(Block Design), OA (Object Assembly), and 
C (Coding). 
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the child, or the examiner’s personal prefer- 
ences for particular subtests may dictate 
the choice. When the range of validity 
coefficients is limited, these considerations 
can be allowed heavier weight. It would 
therefore be misleading to present any 
small group of short-forms of those having 
the highest correlations with the Full 
Scale, which would imply that these are 
recommended for use by the school psy- 
chologist. Rather, we have elected to re- 
port a selection of five of each type of 
short-form, chosen from among the top 
two dozen correlations with the Full Scale. 
An attempt was made to include short- 
forms which varied maximally in composi- 
tion. These will serve simply to give some 
idea of the disparity in coefficient size 
among the various types of subtests, and 
between our results and those of previous 
investigators. A more detailed account of 
our findings is on file and can be obtained 
by interested individuals.’ 

The 25 short-forms which we have se- 
lected are listed in Table 1 along with the 
respective correlations with the Full Scale 
score, and regression equations for pre- 
dicting the Full Scale score from the short- 
form in scaled score (not IQ) units. The 
IQ can then be determined from Table XI 
B of the WISC Manual (Wechsler, 1949). 

In general, the short-form validities we 
obtained are somewhat lower than those 
found by Geuting (1959) for normal chil- 
dren, and slightly lower than those re- 
ported by Carleton and Stacey (1954) for 
their mixed sample. This is about what one 
would expect; the correlations are largest 
when the sample is most heterogeneous, 


* Tables listing the most valid 10 duads, 
15 triads, 15 tetrads, 26 pentads, and 28 
hexads, with a regression equation for each, 
have been deposited with the American 
Documentation Institute. Order Number 
6250, from ADI Auxilliary Publications 
Project, Photoduplication Service, Library 
of Congress, Washington 25, D. C., remitting 
in advance $1.25 for microfilm, $1.25 for 
photocopies. Make checks payable to Chief, 
Photoduplication Service, Library of Con- 
gress. 
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and tend to become attenuated as the 
sample becomes more homogeneous. 


DIscussIon 

There is no completely objective way to 
establish the entitlements of an adequate 
short-form of an intelligence test. The es- 
timated magnitude of the error made in 
using a short-form instead of the Full 
Scale depends upon the variability of Full 
Scale scores, the correlation between them 
and the short-form scores, and the level of 
confidence with which we wish to make the 
predictions. In this study, the SD of the 
Full Seale scores is 10.04 in scaled score 
units. Using the customary .05 level of 
confidence, the estimated error when the 
correlation is .95 is +6.1 scaled score units. 
For a correlation of .90, it is 8.6 scaled 
score units, for a correlation of .85, 10.4 
units, and for a correlation of .80, 11.8 
units. 

The applied user of the short-form is, of 
course, free to decide for himself how much 
risk he is willing to take. But it would cer- 
tainly seem that a potential error of 10 
scaled score points is too high a price to 
pay for economy. If one hence insists on a 
correlation of .90 or higher, none of the 
duads and triads, and only a few tetrads, 
are useable. Only pentads and hexads 
would qualify. 

The applied use of our results should 
properly be restricted to children of ap- 
proximately the same description as those 
in our sample. This means that the tester 
must have some a priori knowledge of the 
child’s intellectual level, which probably 
precludes the use of any short-form for an 
original intellectual evaluation. The pur- 
pose of the development of these short- 
forms was to furnish a method of re- 
evaluating the intelligence of known 
mentally retarded children, as in school 
systems where this must be done periodi- 
cally. Such groups of children in most 
metropolitan areas and large towns will 
probably conform to the specifications of 
our sample. 
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TABLE 1 


A SAMPLING or THE Most Vatip WISC 
Suort-Forms or Eacu Type 


Correlation with Regression Eq- 





Subtests Full Scale Score uation:* (Y =) 
APA .789 25X + 31.05 
SPA -742 2.27X + 32.88 
VBD -741 2.86X + 25.72 
PA OA -722 2.05X + 35.42 
AV 717 2.69X + 28.35 
AVPA .843 2.09X + 23.00 
A PAOA 834 1.96X + 25.66 
SV BD . 823 2.37X + 17.60 
VPA BD .817 1.97X + 24.66 
CAPA -812 2.00X + 23.41 
IA PAOA . 893 1.87X + 18.05 
AS PABD 879 1.74X + 19.50 
VPC PACO .878 1.74X + 16.14 
CAPABD .876 1.78X + 17.15 
8 PC PACO .872 1.69X + 17.74 
AV PC PACO -920 1.55X + 12.32 
IAS PAOA -916 1.61X + 14.37 
ASVPABD -915 1.57X + 13.86 
SV PC BDCO -913 1.69X + 7.68 
Vv PC PABDCO 905 1.50X + 13.50 
SV PC PABDCO 945 139X + 8.72 
ASV PC PACO -943 139X + 9.04 
C APC PAOA CO -943 139X + 8.19 
ICA PABDOA -938 145X + 9.37 
CSV PCBDCO 936 153X + 2.46 





* To obtain the estimated Full Scale score predicted 
by a short-form, substitute the S’s short-form score for 
X in the regression equation and solve for Y (the pre- 
dicted Full Scale score). This gives the Full Scale score 
in sealed score units. The IQ can then be determined 
from Table XI on page 26 of the WISC Manual (1949). 


SuMMARY 


This study was an attempt to develop 
short-forms of the Wechsler Intelligence 
Scale for Children for use in re-evaluating 
educable, noninstitutionalized, mentally 
retarded children. Correlations were ob- 
tained between the Full Scale WISC score 
and the individual subtests, and all pos- 
sible duads, triads, tetrads, pentads, and 
hexads of subtests for a group of such 
children. The statistical computations sug- 
gest that only pentad and hexad short- 
forms can be substituted for the Full Scale 
with relatively small loss in accuracy of 
prediction. 
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GRADE DIFFERENCES IN TRANSFER AS A FUNCTION OF 
THE TIME INTERVAL BETWEEN LEARNING TASKS 


MICHAEL GLADIS* 
Johnstown College, University of Pittsburgh 


The purpose of this study was to in- 
vestigate transfer in third-, fifth-, and 
seventh-grade children in a verbal learn- 
ing task when the time interval between 
Task I and Task II was varied. As such, 
it was stimulated by the incomplete nature 
of the evidence concerning changes in 
transfer as a function of age and time. 
Generalizations can be found in various 
textbooks stating that transfer increases 
with age in children but there appears to 
be little supporting evidence as to the 
nature and kinds of increases. In regards 
to the variable of the time interval be- 
tween the first and second learning tasks, 
the evidence is such that it has led Mc- 
Geoch and Irion (1956) to state that 


there are many problems connected with 
the retention of transfer effects which need 
to be worked out. In particular, the relation- 
ship between time and transfer needs to be 
investigated over a considerably wider range 
of learning situations. (McGeoch & Irion, p. 
341). 


METHOD 


Subjects. Three groups of 45 Ss each 
were taken from the third, fifth, and sev- 
enth grades. The fifth-grade students rep- 
resented the entire class of students at the 
Southmont Grammar School.? The seventh- 
grade children were selected randomly 
from some 130 students in this grade at 


‘Now at Veterans Administration Hospi- 
tal, Leech Farm Road, Pittsburgh, Pennsyl- 
vania. 

*The author wishes to express his grati- 
tude to Thomas Kochuba, Principal of the 
Southmont Grammar School, for his interest 
and invaluable assistance throughout the 
course of the investigation as well as to the 
Officials and teachers of the Southmont and 
Tioga Street Schools for their cooperation. 
He also wishes to thank A. W. Bendig for 
his advice on statistical procedures 
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the same school. Since there were only 38 
third-grade students enrolled in the South- 
mont School, an additional seven were ob- 
tained from the Tioga Street School in 
Westmont, which is a neighboring suburb. 
While in each grade level there were sev- 
eral children younger and older than the 
norm age, the average age for each group 
closely approximated 8, 10, and 12 years. 
Design. Ten Ss from each grade level 
were assigned randomly to one of three 
experimental conditions. The three experi- 
mental conditions were based on differ- 
ences in the time intervals between the 
learning of Task I and Task II. The three 
time intervals were 5 minutes, 2 days, 
and 14 days. These groups will be desig- 
nated as 0, 2, and 14 respectively. The 
remaining 15 Ss from each grade level 
comprised the control group for that grade. 
Learning materials and apparatus. The 
learning material consisted of five pairs of 
one syllable four letter nouns. The words 
were selected from a second-grade reading 
list to rule out problems of unfamiliar- 
ity and reading difficulty. The second 
learning task contained the same response 
words as in the first list but the stimulus 
words were changed. Therefore, the trans- 
fer design was of the A-B, C-B type. The 
learning material was presented on a Hull- 
type memory drum. Each stimulus word 
appeared alone for 4 sec. and then the 
stimulus and response words appeared to- 
gether for 4 sec. The intertrial interval 
was 8 sec. Five different orders of the lists 
were used to control for serial effects. 
Procedure. After each experimental S 
was given the usual instructions for paired- 
associate learning, they learned a practice 
list which consisted of 3 pairs of one 
syllable nouns. They then learned Task I. 
Learning on Task I was carried out to one 
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trial beyond the first perfect repetition. 
Group 0 was given a 5-min. rest period 
during which they were permitted to 
browse in the library where the experiment 
was conducted. They then relearned the 
practice task and went on to learn Task 
II. The criterion of learning for Task II 
was one perfect repetition. The same pro- 
cedure was utilized with Groups 2 and 14 
except that the time interval between Task 
I and Task II was 2 and 14 days respec- 
tively. The control groups learned the 
practice task and then immediately learned 
Task II to a criterion of one perfect repe- 
tition. 


REsvULTsS 


Learning on Task I. The mean number 
of trials to the criterion on Task I for the 
third-, fifth- and seventh-grade experi- 
mental groups was respectively 16.2, 9.4, 
and 10.3. An analysis of variance of the 
learning scores on Task I indicated that 
there was a significant difference between 
the grade group means at the .01 level. 
Separate ¢ tests indicated that the mean 
number of trials for the third-grade group 
was significantly greater than the fifth- 
and seventh-grade means, but the differ- 
ence between the fifth- and seventh-grade 
means was not significant. The mean num- 
ber of trials to the criterion for the time 
interval groups did not differ significantly. 

Learning on Task II. The mean number 
of trials to the criterion on Task II for 
each of the experimental and control 
groups is presented in Table 1. The mean 


TABLE 1 
Mean NuMBER OF TRIALS TO THE 
CRITERION FOR THE EXPERIMENTAL 
AND ConTROL GROUPS IN 
LEARNING Task II 








Time Interval 








-,. Control 
Grade 0 2 14 Composite Group 
Third 12.9 13.1 14.0 13.0 10.7 
Fifth 7.4 8.6 7.0 7.7 7.6 
Seventh 4.8 9.4 6.7 7.0 8.5 
Com posite 8.4 10.4 9.2 
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TABLE 2 
MEAN TRANSFER Scores FOR EAcH OF THE 
EXPERIMENTAL SUBGROUPS 








Time Interval 








Grade 0 2 14 Composite 
Third —3.3 —5.5 —10.5 —6.4 
Fifth 8.7 4.4 7.6 6.9 
Seventh 29.6 2.6 24.4 18.9 
Composite 11.7 0.5 7.2 





scores for the third-, fifth- and seventh- 
grade experimental groups were 13.3, 7.7, 
and 7.0. An analysis of variance of the 
learning scores on Task II indicated that 
there was a significant difference between 
the grade group means at the .01 level. 
Separate t tests indicated that the mean 
number of trials for both the fifth and 
seventh grades was significantly less than 
the third-grade mean, but again the dif- 
ference between the fifth- and seventh- 
grade means was not significant. The time 
interval group means also did not differ 
significantly. 

The control group means for the third, 
fifth, and seventh grades respectively were 
10.7, 7.6, and 8.5. While the trend of these 
mean scores is similar to that of the 
experimental groups on Task I and II, 
they did not differ significantly at the .05 
level. 

Transfer effects. To obtain an estimate 
of transfer for each S the formula 


Mean score of the Control group* 
— Experimental S’s score 





Mean score of the Control group 
+ Experimental S’s score 


was utilized. This formula was used since 
it has the major advantage that the limit 
for both positive and negative transfer is 
100% and is derived from one recom- 
mended by Murdoch (1957). 

Table 3 presents a summary of the 
analysis of variance of the transfer scores. 


*Each grade group has its own control 
group. 
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A difference significant at the .01 level was 
obtained between the grade group means 
which were —6.4, 6.9, and 18.9 for the 
third, fifth, and seventh grades respec- 
tively. Separate t tests indicated that both 
the fifth- and seventh-grade mean transfer 
scores differed significantly at less than 
the .05 level from the third-grade mean, 
but the difference between the fifth- and 
seventh-grade means did not reach the .05 
level of significance. Furthermore, the 
analysis indicated that the time interval 
groups did not differ significantly, nor was 
the interaction between grade level and 
time interval significant. 


DIscussIoN 


The major finding of this study was that 
a significant difference in amount of trans- 
fer was obtained among the three grade 
level subjects used in this study. The 
trend of the changes in transfer among the 
three grade levels suggests that the kind 
and amount of transfer obtained may be 
a function of developmental factors, since 
the mean transfer score for the third- 
grade group was negative while it was 
positive for the fifth- and seventh-grade 
groups, with the seventh grade showing 
more transfer than the fifth-grade group. 
An individual analysis of the data indi- 
cated that 18 out of the 30 third-grade 
Ss showed negative transfer while only 13 
of the fifth-grade Ss and 6 of the seventh- 
grade group evidenced negative transfer. 
It is interesting to note the occurrence of 
negative transfer in view of the fact that 
the transfer design, A-B, C-B, was one in 
which the prediction from Osgood’s sur- 
face (1949) would be that positive trans- 
fer would occur. The results of this study 
suggest that the generalization that posi- 
tive transfer will occur when the responses 
are identical but the stimuli change from 
Task I to Task II might not be a valid one 
for all age groups. 

The failure to obtain any significant dif- 
ferences among the various time intervals 
used between Task I and Task II is con- 
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TABLE 3 
ANALYSIS OF VARIANCE OF 
TRANSFER SCORES 














Source df MS F 
Between Groups 8 1760.8 2.76° 
Grades (G) 2 4805.0 7.53°° 
Time Interval (T) 2 948.5 1.49 
GxT + 645.0 1.01 
Within Groups 81 638.0 
Total 89 
*P< 06. 
CP < Mi. 


sistent with findings by Bunch (1938) and 
Bunch & McGraven (1936) who, using 
college students as Ss, also failed to obtain 
any significant differences. It may be that 
the time intervals used in this study, 5 
min., 2 days, and 14 days, which were 
similar to those used by the above-men- 
tioned investigators, are too long to pick 
up any effects, since Hamilton (1950) 
found that the number of correct antici- 
pations tended to decrease as the length of 
the time interval between Tasks I and II 
increased from 8 sec. to one hour but 
showed little change for longer periods of 
time. It should be noted that in the pres- 
ent study the least amount of transfer 
occurred with the 2-day group whose mean 
transfer score was only .5 in contrast to 
the mean scores of 11.7 and 7.2 made by 
the 0 and 14-day groups. Further studies 
using different time intervals appear neces- 
sary to clarify the issue. 

A final point is that a larger difference in 
the rate of learning was found between 
the third- and fifth-grade groups than be- 
tween the fifth- and seventh-grade groups. 
The rate of learning on Tasks I and II for 
the third-grade experimental group was 
significantly slower than the fifth- and 
seventh-grade experimental groups. Fur- 
thermore, the difference between the third- 
grade control group and the fifth-grade 
control group, while not quite reaching 
the .05 level of significance, was larger 
than the difference between the fifth- and 
seventh-grade control groups. Actually, 
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the mean score for the fifth-grade experi- 
mental group was less than the mean 
score of the seventh-grade experimen- 
tal group on Task I, and the same 
trend was found between the fifth- and 
seventh-grade control group means. In 
general, then, all this seems to suggest that 
the difficulty level of the learning tasks 
used in this study is such that peak per- 
formance is probably attained by the time 
the children have reached the fifth grade. 


SUMMARY 


This study investigated transfer in 
third-, fifth-, and seventh-grade Ss in a 
verbal learning task when the time inter- 
val between Task I and Task II was varied 
through the three conditions of 5 min., 2 
days, and 14 days. The learning material 
consisted of meaningful paired associates 
and the transfer paradigm was of the 
A-B, C-B type. A significant difference in 
the amount of transfer was obtained be- 
tween the three grade groups. The mean 
transfer scores for the fifth- and seventh- 
grade groups were positive and signifi- 
cantly different from the mean transfer 


score of the third-grade group which was 
negative. No significant differences were 
obtained between the different time inter- 
vals. The results were discussed in terms 
of their implication for transfer theory and 
related to previous findings in this area. 
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STABILITY OF ITEM STATISTICS IN TWO INSTITUTIONS 
ERNEST R. HILGARD 


Stanford University 


anp EARL L. JANDRON 
San Jose State College 


Although the theory of test construction 
has now reached an advanced stage (e.g., 
Gulliksen, 1950) course examinations in 
colleges and universities, including those 
that consist of objective test items, often 
fall short of desirable test characteristics. 
Test theory alone will not produce good 
examinations; it is necessary to give the 
items careful local consideration if the test 
is to meet educational as well as measure- 
ment objectives. The present paper bears 
on this problem by presenting some experi- 
ences in the testing of the same items at 
two institutions in which arrangements for 
the course differ widely. 

The course selected for study was in- 
troductory psychology. At San Jose State 
College the course is taught in small sec- 
tions meeting three times a week for a 
semester, and the course with which we are 
concerned is limited to first-semester 
freshmen. At Stanford the course is taught 
in a large lecture meeting four times a 
week for a quarter, with a one-hour dis- 
cussion period in small groups conducted 
by teaching assistants. Most of those who 
take the course are sophomores, although 
there are a number of upper classmen and 
a few freshmen. The purpose obviously 
was not to study the comparability of the 
outcomes of the courses at San Jose State 
College and at Stanford, but rather to see 
whether or not the same questions were 
appropriate for use under such different 
conditions of teaching. Both sets of classes 
used the same textbook as part of the 
course requirement, although the outside 
readings differed widely both in kind and 
amount. 
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PROCEDURES 


Two examinations were prepared and 
used as final course examinations. Exami- 
nation 1, consisting of 230 questions, was 
administered to 221 students at Stanford 
in December 1958. Examination 2, also 
consisting of 230 questions, was adminis- 
tered to 125 students at San Jose in Janu- 
ary 1959. Of the 230 questions on each of 
these examinations, 115 were in common to 
the two examinations. All questions were 
of multiple-choice type, the correct answer 
being selected from four alternatives. 

While questions were carefully reviewed 
and edited, they had not previously been 
tested. They had been constructed in order 
to add to an existing item pool, and this 
constraint upon them led to the inclusion 
of some specialized items that increased 
the average level of difficulty beyond the 
level the authors consider desirable. The 
examinations, taken as a unit, were rea- 
sonably satisfactory, despite the fact that 
there were many questions included that 
were rejected following the pretest. The 
reliabilities (Kuder-Richardson) were for 
Examination 1, .82, and for Examination 
2, .65. The lower reliability for the San 
Jose group (Examination 2) is probably 
a function of item difficulty, the items be- 
ing somewhat too hard for the freshmen 
students inexperienced in this kind of ex- 
amination. 

Item analysis led to two measures: 

1. Item difficulty. The number of cor- 
rect replies was determined for each ques- 
tion and expressed as a per cent. This 
served as a measure of item difficulty, the 
higher per cent being, of course, the easier 
items. 
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2. Discrimination index. A discrimina- 
tion index was computed for each item. 
This is a tetrachoric correlation between 
the single item and the total score on the 
examination. It is computed by determin- 
ing the percentage successful on the item 
in each of two fractions of the whole ex- 
amination, the upper 27% and the lower 
27%. Then the tetrachoric correlation is 
looked up in an item analysis table (Fan, 
1952). 

Our results concern the stability of these 
two measures between the two college pop- 
ulations, and the relationships between 
the two measures. 


CoNSISTENCY OF RELATIVE 
Item DIFFICULTY 


Many differences in student selection, 
in stage of academic career, in time de- 
voted to individual topics, contributed to 
a different average performance on the 
questions in common at the two teaching 
centers, but this is an advantage for the 
purposes of our analysis. At San Jose the 
mean per cent passing each of the 115 
questions was 47.6, with a standard devia- 
tion of 20.6; at Stanford the mean per cent 
passing was 63.5, with a standard deviation 
of 19.2. This yields a highly significant dif- 
ference between means (CR = 10.7). The 
distributions are of the questions accord- 
ing to their difficulty; a related, but dif- 
ferent, statistic would be found for differ- 
ences in mean scores for the students at 
the two places. 

Despite the differences in average diffi- 
culty level, the relative difficulty of items 
at the two places was similar. The disper- 
sions were similar, and the two distribu- 
tions of difficulty were correlated to the 
extent of r = +.80. These results confirm 
an earlier report in which a correlation of 
.87 was found between item difficulty at the 
University of Michigan and at Rutgers 
University for the same 80 items used at 
both places (Hilgard, Teevan, & Wells, 
1954.) This relative consistency means that 
one college or university can profit by using 


items pretested at another, even though 
local circumstances shift the difficulty of 
the items up or down. 


CoNSISTENCY IN ITEM 
DISCRIMINATION 


Item discrimination, as measured by 
tetrachoric correlations between the pass- 
ing and failing of individual items and the 
success on the test as a whole, turned out 
to be not significantly different at the two 
places. The mean tetrachoric for the 115 
questions at San Jose was .26 with a stand- 
ard deviation of .18, and at Stanford .25, 
standard deviation .15. The product-mo- 
ment coefficient of correlation for the dis- 
crimination turns out to be +.31 which 
indicates that test items that discrimi- 
nated at one place tended to discriminate 
at the other. However, the correlation is 
rather low and suggests that an instructor 
can not assume that because an item is 
very discriminating at one place it will 
invariably be very discriminating at an- 
other. 

There are some problems that arise 
when tetrachoric correlations are used in 
item testing. In general, more difficult 
items yield lower correlations with the 
criterion, because the guessing factor be- 
comes more important. The correction 
proposed by Carroll (1945) is of little help 
when it comes to item selection, for the 
lower reliabilities due to guessing do in 
fact persist in determining the actual test 
scores. Correlations with the total score 
also tend to be reduced for items that de- 
part widely from the average difficulty of 
the items. Because of the correlation in 
item difficulty between the two tests, some 
correlation in discrimination indexes is to 
be expected. 


SuGGEestTions For Locat Test 
CONSTRUCTION FROM ITEMS 
PRETESTED ELSEWHERE 


The comparability of the relative dif- 


ficulty of items at the two places permits 
some suggestions as to the appropriate use 
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at one place of multiple-choice items pre- 
tested elsewhere. While the course under 
consideration was one in psychology, the 
suggestions should apply as well to other 
courses using multiple-choice questions. 

1. Items found satisfactory elsewhere 
are likely to provide a useful pool, but the 
average difficulty level may shift. 

2. Local records of percentage passing 
should be kept, so that item difficulty can 
be controlled according to the nature of 
the students and of the course taught. 

3. Where the pool of questions is large 
enough for appropriate control of diffi- 
culty, it is desirable to narrow the range 
of difficulty of the items. Keeping diffi- 
culty nearly constant will improve both 
the reliability and the validity of the test 
according to Cronbach and Warrington 
(1952) and Lord (1952). 

4. In selecting for difficulty it is better 
to err on the side of choosing questions 
that are too easy than those that are too 
hard. It has been shown by Lord (1952) 
that the optimum level of difficulty is 
somewhat easier than halfway between a 
chance level and 1.00. For a four-choice 
item, this means a level above the mean 
between .25 and 1.00, or something over 
625. By this criterion, the level of diffi- 
culty of the items was about right for the 
Stanford students, but the test was too 
difficult for the San Jose students, result- 
ing in lower reliability. A practical ad- 
vantage of using somewhat easier ques- 
tions is that the resulting distribution of 
scores is likely to be skewed somewhat to 
the lower end, thus, making for good dis- 
crimination at the point where decisions 
about passing and failing have to be made. 

5. Because of the face-validity of edited 
and pretested questions, correlation with 
the criterion can be assumed provided the 
correct answer is given at a level well 
above chance. Hence, the laborious com- 
putation of discrimination indexes is un- 
necessary in selecting questions to remain 
in the local pool of items. Because of mul- 
tiple objectives in many courses, too high 
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a dependence upon internal consistency 
(i.e., correlation with total score) may 
actually defeat some of the objectives. In 
that case, a dependence on face-validity in 
terms of one or more of the course ob- 
jectives, plus a high percentage passing, 
will conserve the good items. 

Because all of these controls can be ex- 
ercised merely through keeping a record of 
item difficulty, those who seriously wish 
to improve measurement in their courses 
are well advised to keep such records on 
the item pools from which their examina- 
tions are constructed. If they do this, they 
may use with advantage items pretested 
elsewhere. 


SUMMARY 


1. Multiple-choice items to be used in 
courses in introductory psychology were 
pretested at San Jose College and at Stan- 
ford University, where the clientele and 
arrangements for the first course differ 
markedly. There were 115 questions in 
common between the two tests, making 
comparisons possible. 

2. Despite a statistically significant dif- 
ference in average difficulty of the items, 
difficulty correlated .80 between the two 
tests. The mean discrimination indexes 
(correlations of individual items with 
total test) did not differ significantly at the 
two centers, and the two sets correlated 
.31 with each other. The results for both 
item difficulty and discrimination suggest 
the usefulness at one place of items pre- 
tested elsewhere. 

3. Because of the importance of item 
difficulty in constructing a satisfactory 
examination, local norms are important. It 
is preferable to keep item difficulty rela- 
tively uniform, and to err on the side of 
having items too easy rather than too 
difficult. 


REFERENCES 


Carro.., J. B. The effects of difficulty and 
chance success on correlations between 
items or between tests. Psychometrika, 

1945, 10, 1-19. 








198 ERNEST R. HILGARD AND EARL L. JANDRON 


CronsacH, L. J.. & Warrinaton, W. G. Ef- 
ficiency of multiple-choice items as a 
function of spread of iiem difficulty. 
Psychometrika, 1952, 17, 127-146. 

Fan, C-T. Jtem analysis table. Princeton, N. 
J.: Educational Testing Service, 1952. 
GuLLIKSEN, H. Theory of mental tests. New 

York: Wiley, 1950. 
Hivearp, E. R., Teevan, R. C., & Wetts, W. 


D. Instructors manual. New York: Har- 
court, Brace, 1954. 

Lorp, F. M. The relation of the reliability of 
multiple-choice tests to the distribution 
of item difficulties. Psychometrika, 1952, 
17, 181-194. 


(Received October 23, 1959) 





Jou 
196 








[ar- 
r of 
ion 
952, 





Journal of Educational Psychology 
1960, Vol. 51, No. 4, 199-207 





THE EFFECTS OF VERBAL MEDIATION ON THE 
MODIFICATION OF CHILDREN’S ATTITUDES! 


SAMUEL H. OSIPOW* 


Syracuse University 


The purpose of the present experiment 
was to investigate the effect of verbal 
mediation on the acquisition of attitudes. 
The general hypothesis was that labels 
and their associated stimulus components 
mediate approach and avoidance re- 
sponses. A secondary hypothesis was that 
initial preferences for stimuli influence the 
rate of learning new affective labels for 
these stimuli. 

From the school’s standpoint, these hy- 
potheses appear to be extremely relevant. 
On the one hand, the teaching-learning 
process is mediated by the use of language 
and other symbols. On the other hand, the 
modification of attitudes is among the 
more important of the school’s objectives. 
Among the many theories concerning the 
development and modification of attitudes, 
the one advanced here attempts to demon- 
strate that language contributes to at- 
titude formation because of the affective 
meanings that are learned to verbal sym- 
bols, thereby relating one feature of the 
teaching-learning process to one of the 
school’s objectives. In order that the 
present study might be still more appli- 
cable to the school situation the subjects 
used were from a population of fourth- 
grade pupils. 

The rationale is based on the theoretical 
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discussion of higher order learning pre- 
sented by Dollard and Miller (1950). La- 
beling, i.e., the connection of verbal re- 
sponses to stimuli, is considered a basic 
process in this theory. The associative 
stimulus components of the label evoke 
response-produced cues leading to the dis- 
tinctive emotional or instrumental re- 
sponses. 

On this basis, preferential behavior is 
assumed to be mediated by a mechanism 
similar to the rg-sg mechanism described 
by Hull (1931). Briefly, a name is learned 
for an unnamed object. This name serves 
as a stimulus for a learned evaluative 
label (Osgood & Suci, 1955), that has ac- 
quired affective associations, i.e., meaning 
(Osgood, Suci & Tannenbaum, 1957). A 
label loaded with positive affective asso- 
ciations mediates an implicit verbal re- 
sponse of approach, resulting in increased 
preference when attached to previously 
neutral objects. Conversely, negatively 
loaded labels mediate avoidance responses 
when associated with previously neutral 
objects, resulting in lower preferences for, 
and avoidance of, the object. 

Following the general laws of learning, 
it is assumed that whatever label is con- 
nected to a stimulus is strengthened with 
each reinforced trial, and will be retained 
as long as it continues to be reinforced. On 
the other hand, an inappropriate label, one 
not reinforced, will be extinguished. Rein- 
forcement will occur if the label provides 
S a useful basis for the discrimination (or 
generalization) of responses, retroactively 
provides primary drive reduction, or in- 
hibits responses which provide a learned 
drive. 

The foregoing rationale implies that new 
verbal labels may be substituted for labels 
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with strong response potential provided 
the new labels are made appropriate by 
reinforcement. Accordingly, rewards pro- 
vided for associating approach labels with 
noxious objects result in the extinction of 
the connection between the avoidance la- 
bel and the object. The avoidance label 
is replaced by an approach label, which, 
in turn, elicits approach responses. The 
process involved in learning to attach 
avoidance labels to initially approach 
stimuli is fundamentally the same. Via 
general transfer principles it is assumed 
that the rate of acquisition of a label for 
a previously unlabeled object is not a 
function of the meaning of the label since 
zero transfer is involved. On the other 
hand, the rate of learning labels for objects 
already possessing names is expected to be 
an inverse function of the amount of dif- 
ference between the meanings of the 
original and new labels since differing de- 
grees of positive and negative transfer are 
involved. 

Several empirical studies in learning 
(Schaeffer & Gerjuoy, 1955; Spiker, 1956; 
Spiker, Gerjuoy, & Shepard, 1956) have 
demonstrated that labels aid in the de- 
velopment of new discriminations or gen- 
eralizations. Parallel investigations (Stag- 
ner & Britton, 1949; Staats & Staats, 
1958; Staats, Staats, & Biggs, 1958; Rhine 
& Silun, 1958) provide evidence that at- 
titudes can be learned by conditioning 
techniques, while Eisman’s (1955) study 
provided support for the application of 
mediated generalization to the study of 
attitudes. 

The above rationale led to the follow- 
ing hypotheses: 

1. The rate of acquisition of labels for 
unlabeled objects is not a function of the 
meaning of the labels. 

2. The rate of acquisition of labels for 
labeled objects is an inverse function of 
the disparity between the meanings of the 
old and new labels. 

3. A change in attitude is mediated by 
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the preference associations stimulated by 


labels. 
METHOD 


Subjects 


The Ss used in the experiment were 25 
male and 31 female fourth-grade pupils 
(mean age 124.6 months, standard devia- 
tion 5.28 months). Seven additional Ss 
were discarded for failure to reach a learn- 
ing criterion discussed below. 


Materials 


Eight sets of wooden blocks were used 
as stimuli in Phase C of the experiment. 
Each set was painted a different color and 
contained one cube, one three-dimensional 
flat triangle, one elongated cylinder, one 
short cylinder, and one parallelogram. 
Nonsense figures were made of black oak- 
tag paper and used as stimuli in Phases B 
and D. The figures were ambiguous in that 
none of the familiar names (triangle, rec- 
tangle, and so on) were appropriate labels. 
A wooden board with three indentations 
was used in Phases B and C to prevent 
the presence of extraneous cues regarding 
the location of a paper token used as re- 
inforcement and placed beneath the object 
identified as “correct.” 


Procedure 


The procedure was divided into four 
phases, as summarized in Table 1. All four 
phases were conducted in one session. 
Phases B, C, and D were modeled after the 
mediational sequence (A-B, B-C, A-C) 
demonstrated (Bugelski & Scharlock, 
1952) to be the most effective in mediating 
responses. 

Which of the Ss was to take part in the 
experiment at any particular time was 
determined by the teacher, who chose the 
next S to see EZ. E then matched S’s name 
to a number (from one of four) selected 
at random to assign the S to a group. The 
last three Ss were assigned to groups lack- 
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ing an S in order to equalize the N in each 
cell of the design. 

Phase A. Four pretests were adminis- 
tered before S participated in the experi- 
ment. Pretest 1 was designed to determine 
whether S knew the meanings of the words 
PRETTY and UGLY, the two “unconditioned 
stimuli.” EZ showed Binet cards, Age IV-6, 
and asked which person on the card was 
prettier and which was uglier. In Pretest 
2, E asked S which of the people depicted 
on the card he would rather be hke in 
order to determine whether S had ap- 
propriate meanings for the words PRETTY 
and uGLy. Pretest 3 was designed to secure 
S’s initial preference ranks for the non- 
sense figures. The figures were placed be- 
fore S, who was instructed to look at the 
pictures carefully and select the one he 
preferred. When the selection had been 
made, E removed the figure, and in- 
structed S to select that one of the remain- 
ing figures he now preferred. This pro- 


cedure (Schroder, 1956) was followed until 
all figures were chosen. A rank order pref- 
erence for ihe nonsense figures for each 
S was thus obtained. Pretest 4 was de- 
signed to determine S’s preference for 
colors. The rank order of preferences for 
eight cubes, each a different color, was ob- 
tained by the same ranking procedure 
used in Pretest 3. 

Phase B. In this phase Ss were trained 
to label the nonsense figures. The general 
method required each S to attach a color 
name to one of his middle-ranked figures. 
The S was instructed that he was about 
to play a game in which he could earn 
tokens that could be traded for a toy. The 
board with three nonsense figures placed 
over the holes was presented to S. Two of 
the figures corresponded to those de- 
manded by the experimental condition to 
which the S was assigned, the third was 
selected at random from the remaining 
figures. S was instructed that: (a) two of 


TABLE 1 


Group Phase A 








I | Pretests: 
1. Knowledge of Pretty 


SuMMARY OF PROCEDURES 


Phase D 





Learn to call 5th (some Ss 4th) | Learn to call most pre- | Re-rank non- 
preferred nonsense figure by 


ferred color PRETTY sense figures. 


and UGLY. most preferred color name, and least preferred 


2. Affective value for 
PRETTY and UGLY. 

3. Rank nonsense fig- 
ures. 

4. Rank colored cubes. 


and 4th (some Ss 5th) pre 
ferred nonsense figure by least 


color UGLY. 


preferred color name. 


II Same as Group I Learn to call 4th (some Ss 5th) | Learn to call 4th (some | Same as Group I 
preferred nonsense figure with Ss 5th) preferred color 
4th (some Ss 5th) preferred | prerry and 5th (some 
color name, and 5th preferred | Ss 4th) preferred color 
figure (some Ss 4th) with 5th UGLY 


(some Ss 4th) preferred color 


name 


Same as Group I. 


III Same as Group I 


Same as Goup IT 


IV Same as Group I 


Learn to call most pre- | Same as Group I 
ferred color UGLY and 
least preferred color | 
PRETTY 


Learn to call 4th (some | Same as Group I 
Ss 5th) preferred Color 
x, and 5th (some Ss 
4th) preferred Color y 
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the figures had color names and one did 
not; (b) he was to determine what these 
names were; and (c) one of the figures 
was named Color 1 and the other figure 
Color 2 (the two appropriate color 
names). EZ then told S that he would be 
asked which figure was named Color 1. 
When he had decided, he was to name the 
figure, point to it, then lift it and look 
beneath it into the indentation. If he 
found a paper token he would know he 
was correct. If he did not find the token, 
then he was wrong, and the figure was not 
named Color 1. £ then instructed S to 
close his eyes, and the procedure was be- 
gun. The selection of a figure for each of 
the two color names was considered one 
trial. The intertrial interval varied, being 
determined by the speed with which E 
set up the next presentation. The pro- 
cedure was followed until S correctly an- 
swered five consecutive trials. After this 
criterion had been reached, 15 additional 
trials were administered. Seven Ss failed 
to reach the criterion after 65 trials and 
were discarded from the study. Two of 
these Ss were in Group I, three in Group 
II, one in Group ITI, and one in Group IV. 

The specific treatment in Phase B re- 
quired Ss in Groups I and III to learn to 
call one middle-ranked figure by the name 
of their most preferred color and the other 
middle-ranked figure by the name of their 
least preferred color. The order was 
determined at random so that some Ss 
learned to call their fourth preferred figure 
by the name of their least preferred color 
and their fifth preferred figure by the name 
of their most preferred color. The pairing 
of figure with color name was reversed for 
the alternate Ss. Groups IT and IV learned 
to call one middle-ranked nonsense figure 
by the name of the other middle-ranked 
color. 

Phase C. After Ss learned color names 
for nonsense figures they were trained to 
learn labels to selected colors of blocks. 
The general method was the same as that 
of Phase B except that the color names 
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were replaced by pretty and vuGLy for 
Groups I, II, and III, and by x and ry 
for Group IV. 

The specific treatments were as follows: 
Ss in Group I named most preferred colors 
PRETTY and least preferred colors UGLY. 
Ss in Group II learned to name one mid- 


dle-ranked color pretty and the other | 


middle-ranked color ucty. Ss in Group 
III learned to name most preferred colors 
uGLY and least preferred colors PRETTY. 
Ss in Group IV, a control group, learned 
to name one middle-ranked color x and 
the other middle-ranked color yr. 

Phase D. After completion of Phase C, 
E informed S that enough tokens had been 
earned for a toy but that one more ac- 
tivity would be required before the prize 
could be chosen. E then presented the 
nonsense figures following the same pro- 
cedure as in Pretest 3 of Phase A. Thus, 
Phase D consisted of a re-ranking by each 
S of his preferences for the nonsense fig- 
ures. 


RESULTS 


The groups did not differ in the number 
of trials to criterion in Phases B and C. 
However, to permit a comparison between 
the rates of acquisition of the labels of 
the groups, the data were Vincentized by 
dividing the total number of trials to reach 
criterion for each S into fifths and com- 
puting the percentage of the number of 
correct trials for each block. 


Phase B 


The data are summarized graphically 
in Fig. 1 for all four groups during Phase 
B. Hypothesis 1 was tested by three spe- 
cific predictions related to the performance 
of the four groups in Phase B. 

Hypothesis 1a. The mean percentage of 
correct responses over blocks of trials will 
show an increase in performance for the 
groups in Phase B. 

Hypothesis 1b. The mean percentage of 
correct responses over blocks of trials will 
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Fic. 1. Vincentized performance curves for Groups I, II, III, and IV during Phase B. 


follow the same pattern for all groups in 
Phase B. 

Hypothesis 1c. The mean percentage of 
correct responses over blocks of trials will 
be the same for all groups in Phase B. 

It was necessary to test Hypothesis la 
to insure that the labels had been attached 
to the nonsense figures by the Ss. The data 
relating to this hypothesis were analyzed 
by a Type I analysis of variance (Lind- 
quist, 1953). The summary of this analysis 
is presented in Table 2. The between trials 
F of 110.27 is significant (p < .001) 
thereby indicating improvement in per- 
formance. 


Hypotheses 1b and le were of interest 
since differences in performance between 
the various groups in the nonsense figure 
labeling task might be related to differ- 
ences in labeling colored objects and/or 
preferences for the figures. The interaction 
of trials by groups shown in Table 2 has 
an F ratio of 0.71, indicating that the dif- 
ferences between the slopes of the curves 
of the groups are small enough to have oc- 
curred by chance. The between groups F 
of 1.49, shown in Table 2, indicates that 
the vertical displacement of the curves 
did not differ in Phase B. Accordingly, 
Hypotheses 1b and le were not rejected. 
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TABLE 2 
ANALYSIS OF VARIANCE OF PHasE B 








Source | a | us| F| » 





Between Groups I, II,| 3 702.37) 1.49) >.05 


III, IV | 
Between Ss in the same | 52 | 471.26) — | — 
grp. | 
Total between Ss | 55 - -|- 


| 
29419. 90/110.90) <.001 


Between Trials I through 4 
Vv 





Interaction: Trials X 12 190.17) 0.71) >.05 
Groups | 
Interaction: Pooled Ss X | 208 266.79) — - 
Trials | 
Total within Ss 224 = _ - 
Total 29 | — | — | - 








The conclusion is that no differences in 
performance existed among the groups 
during Phase B. 


Phase C 


The data for Phase C, in which Ss 
learned to attach the evaluative names to 
the colored blocks, were analyzed by anal- 
ysis of variance. The specific hypotheses 
related to Hypothesis 2 concerned differ- 
ences in performance among the groups 
within this phase. 

The first of these subhypotheses was 
Hypothesis 2a, that the mean percentage 
of correct responses over blocks of trials 
in Phase C will be greatest for Group I, 
less for Group II, and least in Group III. 
The F ratio between groups is 6.25, indi- 
cating a significant (p < .01) difference 
among groups in the percentage of correct 
responses. 

The possibility that the variances of the 
groups may have contributed to this dif- 
ference was tested via Bartlett’s test for 
homogeneity of variance. With 3 degrees 
of freedom and a chi square of 1.52, the 
null hypothesis that the groups comprised 
a population of common variance was not 
rejected. 

Since the analysis of variance indicated 
that the differences among the groups were 
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not likely to have occurred by chance, 
separate comparisons were made by the 
Scheffé test (1953) applied to the Vin- 
centized means between groups for Phase 
C. The results of this analysis are sum- 
marized in Table 4. A significantly higher 
percentage of correct responses were made 
by Group I than by Group II (p < .01) 
or by Group III (p < .05). Groups II 
and III did not differ significantly in the 
mean percentage of correct responses 
(p > .05). 

The second of the subhypotheses was 
Hypothesis 2b, that the mean percentage 
of correct responses over blocks of trials 
would show an increase in performance 





for the groups in Phase C. This hypothesis | 


was of concern since the observation of 
mediation effects in Phase D depends upon 
the association of colors with evaluative 


TABLE 3 
ANALYSIS OF VARIANCE OF PHASE C 











Source | df MS F | p 





2 /12453.05) 6.25) <.01 





Between Groups I, II, 





Ill 
Between Ss in same | 39 | 1991.98) — _ 
group | 
Total between Ss 41 —- j—] - 
Between Trials I through | 4 |27019.54| 41.83, <.001 
V 
Interaction: Trials X | 8 | 1680.71) 2.60) <.05 
Groups | 
Interaction: pooled Ss X | 156 | 645.87) — _ 
Trials | 
Total within Ss | mi-ji-|- 

Total |} 29) — |—| - 

t 
TABLE 4 

CoMPARISON OF THE MEAN NUMBER OF 


Correct RESPONSES OVER’ TRIALS 
DURING PHasE C ror Groups I, 














II, anv III 
Mean Contrast Result ? 
I 82.94/IminusII /|I> II <.01 
II 56.7 II minus III |= Il; — 
III 65.02 | I minus IIT I> Ill <.05 
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Fic. 2. Vincentized performance curves for 
Groups I, II, and III during Phase C. 


names. The between trials F of 41.83 
shown in Table 3 is significant (p < .001) 
and supports the hypothesis. The per- 
formance curves for Groups I, II, and III 
showing the mean percentage of correct 
trials per fifth of trials to criterion for 
Phase C are presented in Fig. 2. 

The third of the subhypotheses in this 
phase was Hypothesis 2c. It was predicted 
that the mean percentage of correct re- 
sponses over blocks of trials would follow 
a different pattern for each group in Phase 
C. In this hypothesis different perform- 
ance curves were predicted for the experi- 
mental groups. The F of 2.60 for the be- 
tween groups over trials interaction shown 
in Table 3 is significant (.01 << p < .05), 
supporting the hypothesis. 


Phase D 


The differences in rank between the non- 
sense figure that had been associated with 
the color called pretty and the figure as- 
sociated with the color called ucLY were 
of interest in Phase D. For Group IV, the 
concern was with the differences in rank 
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between the figures associated with x and 
y. The data relevant to these comparisons 
were analyzed by analysis of variance 
(Type I). Since the pairing of the fourth 
or fifth figure with the positive or nega- 
tive stimulus was made at random, the 
initial rank of the figure was not con- 
sidered in the analysis. The summary of 
the analysis is presented in Table 5. 

The mediation hypothesis (Hypothesis 
3) was tested through two subhypotheses 
by comparisons of the ranks of the figures 
and by comparisons between the ranks 
given by the four groups. 

In Subhypothesis 3a it was predicted 
that the difference between the rank of 
the figure in the PRETTY sequence and the 
UGLY sequence would be significant for 
Groups I, II, and III. The between meth- 
ods F of 9.21 (see Table 5) is significant 
(p < .01). The figures in the sequence in 
which the word PRETTY was used were 
ranked higher in preference than those in 
the sequence in which the word UGLY was 
used. These data support Hypothesis 3a. 

In Subhypothesis 3b, it was predicted 
that the mean difference score of ranked 
preferences between the PRETTY and UGLY 
sequences would be the same for Groups 
I, II, and III, and that Group IV would 


TABLE 5 
ANALYSIS OF VARIANCE OF PHasE D 








| | | 














Source | da | MS | F p 
j | 
Between Groups I, II, 3 | 1.38 | 0.32 | >.05 
III, IV(A) | 
Between Ss in same| 52 4.38 _ _- 
group | | | 
Total between Ss | 55 lm lawl @ 
Between Methods® I and | 1 | 28.00 | 9.21 | <.01 
and I> (B) | 
Interaction—A X B 3 | 0.24 | 0.08 | >.05 
Pooled Ss X B |} §2 3.04 _ - 
Total within Ss 56 - i= — 
| 
Total lan} — | — - 





® Method I refers to association with positive prefer- 
ence. 

> Method II refers to association with negative pref- 
erence. 
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have a smaller difference score than the 
other groups. The between Groups F of 
0.32 shown in Table 5 indicates that no 
differences existed between the changes in 
rankings made by the four groups. Thus, 
Hypothesis 3b is not supported. 

In order to investigate whether x and 
y had positive and negative meanings 
which mediated the respective changes in 
Group IV, a follow-up study was con- 
ducted. All pupils in the classroom were 
given a paired comparison task to rate 
preferences for five letters, among which 
were the letters x and y. An analysis of 
the number of times x and y were chosen 
was performed by a ¢ test for correlated 
data. The results of this analysis indicated 
that both the Ss and the other pupils in 
the classroom expressed more preference 
for y than for x. This finding is the re- 
verse of expectation, since in Phase D the 
figure called by the name of the color 
called x was more preferred while the 
figure called by the name of the color 
called y was less preferred. 


Discussion 


All groups learned to label a nonsense 
figure with a color name at the same rate 
in the initial task. These results are con- 
sistent with the theoretical considerations 
involved in labeling presented earlier in 
this paper. 

The differences in results obtained 
among the experimental groups’ perform- 
ance in attaching evaluative labels to color 
names can be accounted for by consider- 
ing positive transfer effects. The first ex- 
perimental group had learned labels con- 
sistent with its prexperimental attitudes, 
leading to the greatest amount of positive 
transfer, and superiority in performance. 
In the second and third experimental 
groups the labels did not agree, or were 
in lesser agreement, with prexperimental 
attitudes. This resulted in a smaller degree 
of positive transfer, and perhaps some 
negative transfer, from previous experi- 
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ence with the colors. This would account 


for their somewhat poorer performance 
than Group I. The lack of a difference in 
performance between Groups II and III 
indicates no differential effects in these 
groups in the connection of evaluative 
labels to the colored blocks. 

The Ss in the three experimental groups 
changed preferences in the direction pre- 
dicted. Ss ranked the nonsense figures 
higher in preference when they attached 
the evaluative label pretty to the color 
name and lower in preference when they 
attached the evaluative label ucLy to the 
color name. Thus, until the results of the 
control group are considered, the evi- 
dence corresponds closely with that of 
Eisman (1955) and Rhine and Silun 
(1958). The control group, it will be re- 
called, showed final attitudes similar to 
those of the experimental groups. The im- 
portance of the control group is now 
immediately apparent and cannot be over- 
emphasized. When the results of the ex- 
perimental groups are now compared with 
the control group it will be seen that the 
conclusion that meaning is the relevant 
factor in the mediational process must be 
tempered by the fact that similar findings 
occurred with the control group. This 
modified conclusion suggests that media- 
tion may be a more complex phenomenon 
than has been suggested in the earlier lit- 
erature. An accurate examination and 
identification of the relevant variables in 
the mediation of preferences and attitudes 
must necessarily depend upon more ade- 
quate controls than those used in the past. 
On the other hand, one cannot assume that 
the labeling process had no effect on pref- 
erence. The most likely result of such an 
assumption would be no differences be- 
tween the final ranks of the positive and 
negative nonsense figures. However, a dif- 
ference was demonstrated in all groups, 
thus, the mediational process apparently 
has an influence on preference but not one 
specifically attributable to the meanings 
of the unconditioned stimuli. 
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SuMMARY 

The purpose of the present study was 
to investigate the effect of verbal media- 
tion on the acquisition of preferences. Of 
secondary interest was the study of the 
rate of acquisition of labels under various 
transfer conditions. The theoretical basis 
of the study was Dollard and Miller’s de- 
scription of the cue-producing properties 
of labels. 

Fifty-six fourth-grade children were di- 
vided into four groups, each group given 
different treatments. Groups I and III 
first learned to connect their most and 
least preferred color names to nonsense 
figures. Group I then labeled its more pre- 
ferred color name with a positive evalua- 
tive name, while the procedure was re- 
versed for Group III. Groups IT and IV 
labeled nonsense figures with middle- 
ranked color names in the first task. Group 
II then labeled its middle-ranked color 
names with positive and negative evalua- 
tive words, while Group IV labeled its 
middle-ranked color names with x and y. 
Finally, all groups were required to rank 
their preferences for the nonsense figures. 

It was found that the groups did not 
differ in the rate of acquisition of color 
names for the nonsense figures. The rate 
of learning to name colors with evaluative 
words was related to the degree of trans- 
fer involved. The preferences for the non- 
sense figures were related to the positive 
or negative nature of the evaluative word 
in the mediational chain. However, Group 
IV, the control group, showed shifts equiv- 
alent to those of the three experimental 
groups. The results were interpreted as 
only partially supporting the mediation 
hypothesis in the formation of preferences. 
The importance of the control group in 
the demonstration of the role of meaning 
was indicated. 
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SOCIABILITY AND ACADEMIC ACHIEVEMENT IN 
VARIOUS TYPES OF LEARNING SITUATIONS! 


LESLIE R. BEACH 
Whitworth College 


In recent decades, educators have be- 
come increasingly concerned with the prob- 
lem of improving college teaching. As a 
result, a veritable rash of formal and in- 
formal studies has cropped out in an effort 
to find the teaching methods which are the 
most effective in the college classroom. 

Much of the formal research on college 
teaching has centered about differences in 
classroom instructional methods, with aca- 
demic achievement being studied as the 
dependent variable (Birney & McKeachie, 
1955). However, comparatively little at- 
tention has been given to the personality 
of the learner as he performs under the 
various forms of instructional method and 
in varying learning situations. It is this 
problem of matching students to learning 
experiences with which the present study 
is concerned. 

The study was conducted as an adjunct 
to a research project sponsored by the 
Fund for the Advancement of Education.’ 
The personality trait, sociability (or social 
introversion-extroversion), was studied in 
its relationship to academic achievement 
in an advanced educational psychology 
course. In establishing the various types 
of learning situations, an effort was made 
to devise situations representing various 
degrees of student interaction. 

The general hypotheses regarding the 
relationship of sociability to course 
achievement which were investigated in 
this study may be stated as follows: 


*Condensation of a paper presented at 
the joint meeting of the Oregon Psychologi- 
cal Association and the Washington State 
Psychological Association, May 9, 1959. 

*Instructional Efficiency Project at the 
University of Michigan School of Education 
under the directorship of Warren A. 
Ketcham. 


The more sociable student will achieve 
more under conditions permitting maximal 
student interaction (as in interactive dis- 
cussion classes or in small group study) than 
will the less sociable student. 

The less sociable student will achieve 
more under conditions of minimal student 
interaction (as in a lecture section or in 
totally independent study) than will the 
more sociable student. 


PROCEDURE AND SUBJECTS 


The experimental Ss for the study con- 
sisted of the enrollees in the advanced 
educational psychology course, Psychology 
of Child Development, at the University 
of Michigan in the fall of 1956-57. The 
students enrolling in this course were a 
mixture of graduate and upperclass under- 
graduate students—a majority of them 
enrolled in the School of Education. 

Following preliminary testing on course 
achievement and personality variables, the 
students were randomly distributed into 
the four experimental groups on a prede- 
termined proportional basis (the reason 
for proportional distribution being to in- 
sure a smaller number of Ss in the discus- 
sion group and to make the lecture group 
the largest group). Group A, the lecture 
group, contained 36 students. They met 
regularly in traditional classroom fashion. 
Group B, the interactive discussion group, 
contained 13 students. They also met regu- 
larly with the instructor. Group C, the 
autonomous small-groups group, contained 
25 students (later subdivided into small 
groups of five each). They met in small 
groups a minimum of nine times through- 
out the semester and did not have any 
contact with the instructor. Group D, the 
independent study group, contained 24 
students. They studied independently 
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ACHIEVEMENT IN VARIED LEARNING SITUATIONS 


throughout the semester. The total experi- 
mental group numbered 98. 

In order to further establish a baseline 
for comparison of achievement in the ex- 
perimental groups, a control group was 
established. This group was comprised of 
2 students in a sociology course (Mar- 
riage) in the College of Literature, Sci- 
ences and the Arts. They were given the 
same achievement tests as the experimental 
groups at the beginning and end of the 
semester. These students had not taken 
and were not currently taking the educa- 
tional psychology course. Most of them 
were from the three upper classes of the 
Literature, Science, and Arts College, with 
a sprinkling of special and graduate stu- 
dents. 


Measures USED 


In the measurement of course achieve- 
ment, a precourse test was given covering 
the content of the course. This test, com- 
posed by the author of the text, consisted 
of 60 multiple-choice items. An item analy- 
sis performed on these questions the pre- 
ceding year indicated that they discrimi- 
nated adequately among students taking 
the course. The identical test items were 
used as a postcourse test of achievement, 
but were scrambled in their order to over- 
come residual practice effects from pre- 
vious experience with the test. The fact 
that several of the students in the course 
had had some work in education raised the 
lower limit of performance on the pretest 
but there was sufficient spread and ceiling 
on the posttest performance to confirm the 
test’s effectiveness as an achievement 
measure. Achievement in the course was 
measured in terms of gain in scores on 
the posttest over the score earned on pre- 
test. 

The S seale (social introversion-extra- 
version) of Guilford’s Inventory of Fac- 
tors STDCR, used as the measure of 
sociability was selected because it is de- 
signed to assess the action dimension of 
the trait. Other scales such as those de- 
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fining sociability as a “neurotic tendency” 
or “dependency need” were considered less 
suitable because the study was more con- 
cerned with actual propensity toward so- 
cial interaction. 


THe LEARNING SITUATIONS 


At the beginning of the semester all en- 
rollees in the Psychology of Child Develop- 
ment course were tested on a number of 
variables pertinent to both the present 
study and the broader research project 
sponsored by the Fund for the Advance- 
ment of Education. All members of the 
group were assigned to the text, given 
an envelope of study material (including 
supplementary reading materials, a bibli- 
ography of readings, and a Study Guide 
for the course) and then assigned to their 
specific experimental groups. 

Special care was taken to make the class- 
room sections typical lecture and inter- 
active discussion sections, these two groups 
being handled by two different instructors. 
Both instructors find their respective 
methods of teaching the most “natural” 
for them. Each instructor is recognized by 
his colleagues as being skilled in employ- 
ing the techniques peculiar to the teach- 
ing method he employed for the classroom 
section studied in this investigation. 

To provide as adequate a picture as pos- 
sible of what transpired in the various ex- 
perimental groups during the course, a 
description of some typical activities and 
conditions within the groups is helpful. 
These descriptions are based on student re- 
ports and upon observations made on a 
time-sampling basis during the conduct of 
the class sessions. 


Group A, The Lecture 


The procedures used with the lecture 
group were generally quite structured. While 
the class was not a large lecture section, it 
was handled much the same as though it 
were. Occasional questions were raised by 
students but interaction among the students 
themselves was virtually nonexistent. The 
class met in a small auditorium and the 
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formal seating arrangement was conducive 
to instructor-centered proceedings. The lec- 
turer displayed a somewhat informal manner 
but he did plan and direct activities, topics 
and presentations. The role of the student 
was primarily that of listener. This learn- 
ing situation was validated as a traditional 
lecture situation through direct observation 
and evaluation on several criteria gleaned 
from a survey of pertinent research. 


Group B, The Discussion 


The classroom situation for Group B dif- 
fered greatly from that of Group A. The 
class met in a small room with seats ar- 
ranged about a large, low table. The physi- 
cal setting was one generally conducive to 
relaxed informality. Coffee and tea were 
available during the class meeting. The gen- 
eral atmosphere and proceedings of the 
class meeting were definitely informal and 
there was a great deal of interaction among 
group members. The role of the instructor 
was principally that of one of the group. On 
occasion he did help to bring into the 
discussion some group members who were 
reticent in their participation. The role of 
the student in Group B was obviously one 
of active participant in the discussion. The 
group was small (14 including the instruc- 
tor) and there was ample opportunity for 
opinions and ideas to be expressed and 
evaluated by all members of the group 
Some special techniques such as role-playing, 
case studies, buzz groups, etc., were em- 
ployed throughout the semester to enhance 
the value of the discussions and to encourage 
active student participation. This situation, 
also, was validated on the basis of several 
criteria found in the literature to be char- 
acteristic of an interactive classroom dis- 
cussion group. 


Group C, Autonomous Small Groups 


It is more difficult to get a representative 
picture of the procedure followed in the 
last two experimental groups because they 
were out of contact with the instructor and 
established their own procedures for study. 
However, through postmeeting reaction re- 
ports which were turned in by each student 
after his group meeting and also through 
private conversations with group members, 
some information was obtained from the 


members of the autonomous study groups. 
Most of the small groups met more than the 
required minimum number of times. The 
meeting places were usually chosen on the 
basis of comfort or convenience. Typical 
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meeting places were vacant and centrally 
located classrooms on the campus, the stu- 
dent union building, or the home or room of 
one of the group members. Many wel- 
comed the opportunity to share more com- 
pletely the different views and experiences 
represented within their small group and 
there was apparently very active participa- 
tion by all members. As was intended, there 
was virtually no contact with the instructors. 


Group D, Independent Study 


Because of the rare contact with in- 
structors or anyone connected with the 
study, little is known of the procedures fol- 
lowed by the students in Group D. Some did 
mention later that they spread their study 
over the whole semester while others found 
it more profitable to concentrate their study 
at certain points. Some reported that they 
“let the studying slide” until near the end 
of the semester. 


RESULTS 


The precourse measures revealed that 
there were no significant differences from 
group to group on either the sociability 
scale or the achievement test. Checks on 
other factors such as intelligence test 
scores, previous courses in psychology and 
educational psychology, etc., revealed little 
difference in these areas. 

Postcourse measures indicated that 
achievement gains in all experimental 
groups were significant beyond the .001 
level of confidence. As shown in Table 1, 
the mean gain of each experimental group 
was significantly greater (also beyond the 
.001 level of confidence in each case) than 
the mean gain of the control group. A 
small (nonsignificant) gain found in the 
control group was considered attributable 
to the fact that some discussion on child 
development entered into the marriage 
course in the discussions on child rearing 
and family relations. 

Turning to a comparison of the more 
sociable student with the less sociable stu- 
dent, an analysis is here presented in terms 
of the within-groups differences. For pur- 
poses of this analysis, the more sociable 
students in a given experimental group 
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TABLE 1 
ACHIEVEMENT GAIN IN EXPERIMENTAL GROUPS AND ContTrROL Group 





ACHIEVEMENT IN VARIED LEARNING SITUATIONS 











Control Group A Group B Gow t eee 
Group (Lecture (Discussion) Groups) Study) 
Mean Achievement Gain 2.7 10.72 8.31 8.84 9.75 
Sigma 4.7 4.36 3.62 4.48 5.99 
t (compared with control group) — 8.44 4.68 5.63 5.07 
t (net gain in achievement) —_ 8.47 4.72 7.34 4.35 
Note.—All t’s are significant at better than the .001 level of confidence. 
TABLE 2 
SocIABILITY AND ACHIEVEMENT IN THE EXPERIMENTAL GROUPS 
— ——s — _ — ee " — = ~ | — = —— — : oan ———— 
Group A Group B 
Lecture) | (Class§Discussion) | 
More Less | More Less | 
Sociable | Sociable Sociable | Sociable | 
| | | 
High Achievers 6 12 18 | High Achievers 1 5 6 
Low Achievers 12 6 18 Low Achievers 5 1 6 
x? = 2.78 (p < .05) 18 18 36 | x? = 3.00 (p < .05) 6 6 12 
Group C Group D 
(Small Groups) (Independent) 
More Less | More Less 
Sociable | Sociable Sociable | Sociable 
High Achievers 10 2 12 | High Achievers 6 6 12 
Low Achievers 2 10 12 | Low Achievers 6 6 12 
12 12 24 2 12 24 


x = 8.17 (p < .01) 


group on the sociability measure and the 
less sociable students in a given experi- 
mental group were those falling below the 
median of that group on the sociability 
measure. This dichotomizing procedure 
was followed with achievement scores also. 
The members of each experimental group 
can then be tabulated in a two by two 
table in terms of relative sociability and 
relative achievement. The chi square test 
of significance (one-tailed), with correction 
for continuity, was applied for each group. 

Table 2 shows the results of these tabu- 
lations in two by two tables for the four 
experimental groups. 

When the chi square test was applied to 
these observed differences in achievement 
between the more sociable and less so- 











x? = 0.17 (ns) 1 


ciable students, significant differences in 
the predicted direction were found in 
Group A and Group C. In Group A, a 
difference in achievement was found (p 
less than .05) in favor of the less sociable 
student. In Group C, a very significant dif- 
ference was found (p less than .01) favor- 
ing the more sociable student. Thus, in 
these two experimental groups the general 
hypotheses of the study find support. 

The results in Group B (the classroom 
discussion group) are quite puzzling. The 
difference here is in the direction opposite 
to that predicted. Discussions with the in- 
structor of this group provided few leads 
as to how the results might possibly be ac- 
counted for. The most promising hypothe- 
sis which might be worthy of further in- 
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vestigation is that the presence of the 
instructor plays a significant role in de- 
termining participation and the amount 
or the kind of participation by the more 
sociable student may vary with the in- 
structor’s presence or absence. The same 
may be true of the less sociable student. It 
is also possible that if other “course out- 
comes” had been evaluated in addition to 
achievement gain in course content, this 
type of result may not have appeared uni- 
formly in Group B. 

No differences in achievement were 
found between the more or less sociable 
students under conditions of completely 
independent study. 


SUMMARY 


In studying the relationship of sociabil- 
ity to academic achievement in a course in 
advanced educational psychology, four ex- 
perimental groups were established. These 
groups, characterized by varying degrees 
of student interaction and student-teacher 
contact, were: A (a lecture section), B 
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(an interactive discussion class), C (a 
group of small instructorless groups of 
five members each), and D (an independ- 
ent study group). In learning situations 
A and C a significant relationship was 
found between sociability and achievement 
as hypothesized, i.e., the less sociable stu- 
dent achieved more than the more sociable 
student in the lecture section and in the 
small-groups group the more sociable stu- 
dent achieved more than the less sociable 
student. It appears that in situations simi- 
lar to the course here described the § 
(social introversion-extraversion) scale of 
the Guilford Inventory of Factors STDCR 
may have some value in identifying those 
individuals who will perform well in these 
two types of learning situations. 
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VARIATION OF WHOLE-PART METHODS OF LEARNING 


F. J. McGUIGAN 
Hollins College 


McGuigan and MacCaslin (1955) in 
comparing the effectiveness of a part, an 
incomplete-whole and a whole method of 
learning found that the whole method was 
superior to the incomplete-whole method 
which in turn was superior to the part 
method. By holding the kind of practice 
(massed or distributed) of the culminating 
aspect of a task constant their comparisons 
led to the conclusion that the whole method 
is superior to the part method largely be- 
cause the whole method involves distribu- 
ted practice. An extension of their reason- 
ing led to the hypothesis tested in this 
study: within limits, the larger the unit 
practiced (and hence the greater the dis- 
tribution of practice on the various aspects 
of the task) the better the performance. 

A secondary purpose was to study the 
effect of varying experimenters. Psycholo- 
gists tacitly assume that their results may 
be generalized to a population of Es, but 
there has been relatively little sampling of 
such a population to justify the general- 
ization.” For if interactions between experi- 
menters and treatments exist, the extent to 
which experimental results may be general- 
ized must be sharply restricted. 


METHOD 


Subjects. Seventy-two undergraduate 
women at Hollins College were used as Ss. 
Nine Es who were women undergraduate 
majors in psychology were trained in the 
experimental procedure. The Ss were ran- 
domly assigned to four groups and each E 
was randomly assigned to run two Ss in 
each group. 

Procedure. The task for the Ss was to re- 


*The necessity for sampling populations 
that are usually held constant in an experi- 
ment (such as a population of Zs) has been 
pointed out by Hammond (1954) and Bruns- 
wik (1956). 
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produce card Design Number XVII of the 
Kohs Block Design Test (C. H. Stoelting 
Co., Chicago, Illinois). The four groups 
were randomly assigned to four conditions: 
(a) part method, (6) very-incomplete- 
whole - method, (c) incomplete - whole - 
method, and (d) whole method. The card 
design was divided into four quadrants. 
The various quadrants were covered with 
pieces of black paper as appropriate for 
each condition. The procedures for the four 
groups were: 


Group P (Part Method). Ss for the part 
method were given four blocks and allowed 
to look at only the first quadrant of the de- 
sign, the remaining three quadrants not be- 
ing exposed to them. Then they reproduced 
the first quadrant with the blocks 12 times 
in succession. Following this the second quad- 
rant was exposed while the other three quad- 
rants were hidden from view. They then re- 
produced the second quadrant 12 times. In 
similar manner the third and fourth quad- 
rants were reproduced 12 times apiece. 

Group VIW_ (Very-Incomplete-Whole- 
Method). The Ss in this group were given 
eight blocks and allowed to look at half of 
the design, vertically divided. They were in- 
structed to reproduce the half of the design 
that they saw 12 times. Following this the 
first half of the design was covered and the 
second half exposed. The Ss then reproduced 
the second half of the design 12 times. 

Group IW (Incomplete-Whole-M ethod) 
The design was divided into two unequal 
parts for these Ss. First the first, second, and 
third quadrants were simultaneously exposed 
while the fourth quadrant was covered. The 
Ss reproduced these three quadrants as a unit 
12 times. Then the first three quadrants were 
covered while the fourth quadrant was ex- 
posed. The Ss then reproduced the fourth 
quadrant 12 times. 

Group W (Whole Method). No part of the 
design was ever covered for the whole method 
Ss. Hence, they reproduced the design 12 
times as a total unit. 


All Ss were allowed to complete the de- 
sign, or portions thereof, at their own pace. 
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Following completion of the above training 
all Ss were presented the 16 blocks and 
were allowed to look at the entire design. 
They were then instructed to reproduce the 
design as rapidly as possible. The time re- 
quired for each S to complete the design on 
this test trial was recorded. 

Using four methods and nine Es, it can 
be seen that a 4 X 9 factorial design was 
employed. The methods constitute a fixed 
variable while Es may be considered to be 
a random variable. Hence the analysis is 
that for a mixed model. 


RESULTS 


The results of the analysis of variance 
are presented in Table 1 where it can be 
seen that the only significant F is for meth- 
ods. The mean times on the test trial for 
Groups W, IW, VIW, and P were 65.85, 
93.87, 167.87 and 166.53 seconds respec- 
tively. With the exception of the reversal 
for Group VIW and Group P, the order of 
means is that hypothesized. Appropriate 
testing indicated no basis for assuming that 
the variances were other than homogene- 
ous. Duncan’s Range Test (Duncan, 1955) 
was applied to determine significant differ- 
ences among the various pairs of means. It 
was found that Groups W and IW were 
both superior to Groups VIW and P at the 
5% level. No other differences were signifi- 
cant. It may thus be concluded that the 
methods that allowed the larger practice 
units (the whole method and the incom- 
plete-whole-method) were superior to those 
that permitted practice on the smaller units 


TABLE 1 


ANALYSIS OF VARIANCE FOR TIME ScorREs 
on Test TRIAL 





| 
Source of Variation | d/ MS F 








Between Methods 3 | 48 
Bi 


, 123.38 | 6.01* 

Between Experi- 2,539.12 | .32 
menters 

MXE 24 | 8,012.26 .93 
Error 36 | 8,580.59 | 





* Significant beyond 1% level. 
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(the very-incomplete-whole-method and 
the part method). 

The lack of a significant interaction be- 
tween methods and Es increases our con- 
fidence in generalizing to a population of 
Es. However, since the population of Es 
sampled was rather restricted, generaliza- 
tion to a larger or somewhat different popu- 
lation must be made with caution. 


DIscussION 


The present status of research on the 
whole-part problem is well summarized by 
Underwood (1949): 


Results have been contradictory, some- 
times showing facility in learning by wholes 
and sometimes by parts, and psychologists 
have been unable to specify adequately the 
nature of the differences between experi- 
ments which would account for discrepancies. 
The reason for this is probably that no com- 
prehensive hypothesis concerning the vari- 
able has been set up by which the success of 
the studies may be evaluated ...there is a 
lack of adequate theory to direct experimen- 
tation (p. 418). 

However, a review of the relevant litera- 
ture indicates few results that would be ac- 
cepted by present day scientific standards. 
For frequently n’s have been extremely 
small, appropriate tests of significance have 
not been applied, confounding of variables 
has been present, ete. The sounder research 
might lead us to conclude that the whole 
method is generally the superior method. 
The large number of variables that are rel- 
evant to the whole-part problem, however, 
would certainly leave room for exceptions 
(cf. Crafts, 1930). In an effort to bring 
order to the whole-part problem, as Un- 
derwood suggests is necessary for further 
research to be fruitful, the following the- 
oretical basis is offered as at least a partial 
explanation for the apparent superiority of 
the whole method. 

The results of the present investigation 
are consonant with the previously ad- 
vanced hypothesis that much of the su- 
periority of the whole method is due to the 
fact that it entails distributed practice. To 
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illustrate this conclusion by means of the 
present task, let us say that R, is the re- 
sponse of placing the blocks together to 
form the first quadrant of the design, R, 
is the response for constructing the second 
quadrant, etc. Now, when the part method 
is used, R, is made repeatedly, then R, is 
made repeatedly, etc. Hence, practice on 
R, , R, , ete. may be said to be massed. The 
whole method, on the other hand, entails 
distributed practice on the four parts of 
the task. Thus, R, is made once followed by 
R,, then R, and finally R,, or whatever 
order S selects. It can thus be seen, from 
this point of view, that the whole-part 
problem becomes one concerned with mass- 
ing vs. distribution of practice. And Hull 
(1951) has offered us a reasonable explana- 
tion of why distributed practice is superior 
to massed practice. Applying his theory to 
the whole-part problem, using the present 
task as an example, we may say that when 
the part method is used, the reactive inhi- 
bition entailed by continuous performance 
of R, is sizeable (Hull’s Postulate IXA). 
However, when using the whole method, 
R, is followed immediately by R, . There- 
fore, the reactive inhibition accumulated 
by performing R, with the whole method is 
rather small; in particular it is smaller 
than when R, occurs repeatedly as with the 
part method. And so on for the remaining 
responses. Following Hull, the greater the 
amount of reactive inhibition present at 
any given time, the greater the amount of 
conditioned inhibition (Hull’s Corollary 
IX). Since with the part method, practice 
on every response is massed (R, for in- 
stance), the amount of reactive inhibition 
present at the end of training is greater 
than for the whole method (where with 
distributed practice on R, for instance, re- 
active inhibition dissipates to a greater ex- 
tent). Therefore, the amount of conditioned 
inhibition attached to any response would 
be greater when the part method is used 
than for the whole method. This means at 
any given time (either immediately after 
the task has been finished, or after consid- 
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erable time when reactive inhibition has 
dissipated) the inhibitory potential for R, , 
R,, Rs, and R, will be greater when the 
part method has been used than when the 
whole method has been used. For this 
reason, other factors being equal, we should 
expect the whole method to be superior to 
the part method. Of course this extension 
of Hull’s theory must be checked by fur- 
ther test, and since other factors are not 
always equal, we should expect some ex- 
ceptions. Further research should enable 
us to better specify the effect of these other 
relevant variables. 

The above reasoning was illustrated by 
using the extreme methods employed in the 
present study, the whole and part methods. 
A general application of the extension of 
Hull’s theory to all groups would be as 
follows. First let us assume the degree of 
distribution of practice for the four groups 
in increasing order is: Group P, Group 
VIW, Group IW, and Group W. That is, 
Group P had the most massed practice on 
each response while Group W had the most 
distributed practice on each response. 
Therefore Group P should have generated 
the largest inhibitory potential for each re- 
sponse, Group VIW the next largest, Group 
IW the next, and Group W the least. There- 
fore, it should be expected that Group P 
would perform the poorest, Group VIW the 
next poorest, Group IW the next, and 
Group W should exhibit the best perform- 
ance. The results were in general agree- 
ment with this expectation, 

It should be pointed out that the above 
theorizing rests on the assumption that the 
inhibitory potential generated by a given 


* As with any known experiment, alterna- 
tive explanations of the present data may be 
expected. That a theory postulating that the 
amount of “integration” of the design is not 
appropriate is suggested by work on this 
variable by Seagoe (1936). For where she 
varied the “integration” of the design repro- 
duced the only difference between whole and 
part methods that even approached signifi- 
cance were where the whole method was su- 
perior. 
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response is uniquely attached to that re- 
sponse. That is, except for possible gener- 
alization of inhibitory potential among re- 
sponses, the amount of inhibitory potential 
relevant to each response is that generated 
by it and by no other response. This as- 
sumption is crucial to the above explana- 
tion because if the reactive inhibition gen- 
erated by R, is also attached to R, , R; , and 
R, then continuous practice by the whole 
method would result in the same inhibitory 
potential as by the part method. 


SUMMARY 


This study sought to test the hypothesis 
that the larger the unit practiced, the bet- 
ter the performance; and also to study the 
effect of varying experimenters. Four meth- 
ods which varied the extent to which part 
or whole learning was possible were used. 
Nine Zs ran 72 Ss, a total of 8 Ss each. The 
results for methods showed, in general, that 
the larger the practice unit, the better the 
performance and were thus in accord with 
the previously advanced hypothesis that 
much of the superiority of the whole 
method may be due to the fact that it en- 
tails distributed practice. Hull’s theory of 
inhibition was extended to provide a the- 
oretical basis for explaining the superiority 
of the whole method over the part method. 


The results also showed that variation of 
Es did not lead to an interaction with 
methods. This finding increases our confi- 
dence in generalizing results for methods to 

a hypothetical population of experimenters, 

though more thorough sampling of such a 

population should occur in future research. 
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THE EFFECTS OF VISUAL DISCRIMINATION PRETRAIN- 
ING ON LEARNING TO READ A VOCABULARY LIST 
IN KINDERGARTEN CHILDREN'! 


SIEGMAR MUEHL 
Iowa Child Welfare Research Station 


To master beginning reading, a child 
must be able to associate word names, or 
meanings, with their printed symbols. To 
make these associations a beginning reader 
must be able to see and respond to the 
printed symbols themselves as distinctive 
and stable stimulus compounds. 

How does this visual discrimination skill 
develop? Theories of perceptual develop- 
ment attempt to relate sensory input to 
the learning process. The perceptual theo- 
ries developed by such writers as Hebb 
(1940), Gibson (1940), and Murphy 
(1951) stress the contribution of sensory 
experience to perceptual differentiation of 
a stimulus field. Thus, for beginning read- 
ers, the question can be asked: What types 
of sensory experiences facilitate visual dis- 
crimination of complex stimuli represented 
by printed words in a reading task? The 
answer has practical implication for visual 
discrimination training in the reading 
readiness program. McKee (1948, p. 147) 
has noted that commercial reading readi- 
ness books concentrate on visual discrimi- 
nation exercises using geometric or animal 
forms rather than printed words. Educa- 
tional research (Goins, 1958) has never 
confirmed the effectiveness of this type of 
discrimination training for beginning read- 
ing. 

Although not directly concerned with 
the learning process defined in terms of 
reading performance, psychological studies 
in the area of discrimination pretraining 
provide hypotheses, research procedures, 


*The author wishes to thank Buford W. 
Garner, Superintendent of Schools in Iowa 
City, John Moore and Richard Hovet, Prin- 
cipals, and Lois McDermott and Nancy 
Cooker, kindergarten teachers, whose co- 
operation made this research possible. 


and findings which may contribute to un- 
derstanding the relation between visual 
discrimination skills and reading perform- 
ance. Studies in this area have investigated 
the effects of different types of stimulus 
discrimination pretraining on performance 
in a subsquent discrimination learning 
task. These studies provided tests of either 
Miller’s (1948) acquired distinctiveness of 
cues hypothesis, or of Gibson’s (1940) 
transfer of discrimination hypothesis. Both 
hypotheses state, in effect, that if differ- 
ences are established among similar stimu- 
lus items or situations, then learning will 
be facilitated in a subsequent task using 
these same stimuli. An analysis of this re- 
search by Spiker (1956), and the later 
findings of Kurtz (1955), indicated that 
stimulus differentiation pretraining does 
result in positive transfer to a second task 
employing the same visual stimuli but dif- 
ferent responses as used in the pretraining 
task. 

The general objective of the present 
study was to extend the experimental pro- 
cedures used in the stimulus pretraining 
research to investigate the question raised 
above: What types of sensory experiences 
facilitate visual discrimination among 
printed words occurring in a reading task? 
Specifically, the purpose was to investigate 
the effects of different types of visual dis- 
crimination pretraining on the perform- 
ance of kindergarten children in learning 
to read a vocabulary list. Prior to learn- 
ing the same vocabulary list, kindergarten 
children were given matching pretraining 
with three types of stimulus material: the 
same words as those appearing in the vo- 
cabulary list; different words; and geo- 
metric forms. Based on the results of the 
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previous experimental work in stimulus 
pretraining, the prediction was made that 
the group given visual discrimination pre- 
training matching the same words would 
learn the vocabulary list faster than the 
groups given pretraining with different 
words or geometric forms. 


METHOD 


Pretraining 


Three different sets of stimuli were used 
in the matching pretraining, with four 
stimuli in each of the sets. There were two 
random orders for each set of four stimuli. 
Set S (same words) consisted of four 
words: BOAT, HELP, PLAY, COME. Set D 
(different words) constisted of four words: 
MAKE, JUMP, WORK, FIND. Set F (control) 
consisted of four geometric forms. Differ- 
ences in these forms were obtained by 
varying a pentagonal figure with a single 
line connecting different inside corners. 
Set S and D stimulus words were typed 
in lower case with a primary style type- 
writer. Set F stimulus forms were drawn 
with ink and measured 144” on the base 
and in height. The individual stimuli in 
all sets appeared on a 244” X 234” white 
card. Accompanying each order of stimuli 
in each set were two, 2%” x 82”, mul- 
tiple-choice response cards. The four stim- 
uli appearing on the stimulus cards of the 
set also appeared in two different arrange- 
ments on the two response cards. 

The apparatus for presenting the stimu- 
lus and response cards consisted of a slop- 
ing metal surface, 842” x 9’. Close to the 
top of this surface and approximately at 
the eyelevel of the S was a small ledge for 
presenting the stimulus cards. The bottom 
three inches of the metal surface con- 
sisted of a hinged window which could be 
raised by the S. A slot behind the window 
held a response card. The latter could be 
placed in position from behind without 
the Ss seeing it. Directly in front of the 
response card window was a flat, 3” x 9” 
panel containing four buttons. When the 


response card window was open, these but- 
tons were directly in front of each of the 
four stimuli on the response card. The 
buttons were placed in a circuit with a se- 
lector switch and chime located behind the 
apparatus. The switch enabled the E to 
select any one of the four response buttons. 
When the correct button was pressed by 
S, the chime sounded. 

Three groups of Ss were differentiated 
with respect to their pretraining experi- 
ence. Ss in Group S were given six trials 
with the four Set S stimuli. Ss in Group 
D were given six trials with the four Set 
D stimuli. Ss in Group F were given six 
trials with the four Set F stimuli. A presen- 
tation for all groups consisted of E’s plac- 
ing a stimulus card in the card holder for 
approximately two sec. S was told to 
look carefully at the card. At the end of 
this interval, E removed the stimulus card 
and S was instructed to open the response 
card window, find the same stimulus on 
the response card that had appeared on the 
stimulus card, and press the button di- 
rectly under this response stimulus. The $ 
had approximately five sec. in which to 
press the button. If S pushed the correct 
button the chime would ring. If S pushed 
the wrong button, or if S failed to respond 
in the approximately five-sec. interval, E 
closed the response card window and a new 
stimulus card was presented. E changed 
the response card after each order of stim- 
ulus cards had been presented. No rewards 
were given during pretraining other than 
verbal encouragement from £ and the re- 
inforcement provided by the chime. 


Learning Task 


Immediately following the pretraining 
matching task, the vocabulary learning 
task was administered. All Ss received 12 
trials with the four stimuli of the Group 
S set. The Ss were instructed that they 
were to try to learn what the words said. 
On the first trial EZ presented each of the 
four stimuli and told S what each said. 
On the following trials, S was instructed to 
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try to beat E in saying the word. Follow- 
ing the first trial E used a paired-associate 
method of presenting the stimulus and re- 
sponse elements. E presented the stimulus 
card. S was given an approximately two- 
sec. anticipation interval in which to say 
the word. At the end of this interval Z 
gave the correct response, with the stim- 
ulus card still present to the S. No rewards 
were given during the learning task other 
than verbal encouragement from £ and 
the reinforcement provided by S’s making 
a correct anticipation. Upon completion of 
the learning task, Z asked the Ss in Groups 
S and D to tell how they matched the 
words in the pretraining task. 


Subjects 


The Ss were 37 children from three 
kindergartens in the public schools in Iowa 
City. Ss were randomly assigned to the 
three pretraining groups. There were 13 
Ss in Group S; 12 Ss in Group D; and 12 
Ss in Group F. Ages ranged from 5-1 to 
6-2. The mean age was 5-6. There were 
no significant age differences among the 
three pretraining groups. Testing was 
done in December. 


RESULTS 


Pretraining 


The means and standard deviations of 
correct responses in the six pretraining 
trials are given in Table 1. The mean 
number of correct responses for the three 
pretraining groups did not differ signifi- 
cantly among the groups. (Lindquist 
[1953] Type I analysis, p > .05). The mean 
number of correct responses over trials 
for the combined groups did differ sig- 


TABLE 1 
NuMBER OF CorRECT RESPONSES IN SIX 
PRETRAINING TRIALS 





Group S Group D Group F 





Mean 18.23 17.75 17.08 


SD 3.55 3.70 1.93 
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BLOCKS OF TWO TRIALS 


Fic. 1. The effect of type of matching pre- 
training on correct responses in learning the 
four word vocabulary list. Eight responses 
were possible in each block of two trials. 


TABLE 2 
ANALYSIS OF VARIANCE OF CORRECT 
RESPONSES IN 12 LEARNING 
TRIALS 
(in Blocks of Two Trials) 


dj | MS FP p 





Source 








Between-Subjects | 36 
9 


Groups (B) 25.88) 3.74) <.05 

Error (b) | 34 6.91 

Within-Subjects | 185 

Trials (A) | §| 28.66) 24.07) <.01 

AXB | 10) 1.41) 1.18) ns 

Error (w) 170} 1.19) 
Total 221) | 





nificantly (p < .01). An analysis of the 
correct response trend over trials indicated 
that discrimination learning occurred dur- 
ing pretraining. 


Learning Task 


The learning curves in the vocabulary 
task for each of the pretraining groups are 
presented in Fig. 1. A summary of the 
Lindquist (1953) Type I analysis of vari- 
ance for these data is given in Table 2. 
As shown in the summary, type of match- 
ing pretraining significantly affected learn- 
ing performance (p < .05). The interaction 
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of type of pretraining with trials was 
not significant (p > .05). When Groups 
S and D, and §S and F were compared in 
learning performance the differences were 
significant (p < .05). Comparing Groups 
D and F, the difference in learning per- 
formance was not significant (p > .05). 


Discussion 

The prediction was made that Ss given 
visual discrimination pretraining match- 
ing the same words to appear in a vocabu- 
lary list would learn this faster than Ss 
given matching pretraining with either 
different words or geometric forms. The 
results supported this prediction. Since 
there was no no-discrimination pretraining 
control group, it was not possible to de- 
termine whether positive transfer resulted 
from the different word and geometric 
form pretraining. 

Ss’ reports on how they matched the 
words, and an analysis of the type of 
intraset errors made during pretraining, 
provided some evidence to account for 
these results. Single letters or details of 
these letters appeared to be the basis for 
word matching. The characteristics that 
made a letter stand out visually were posi- 
tion within the word, most often first and 
last letters, and unique shape. This evi- 
dence suggests that the positive transfer to 
the learning task in Group S was due to 
the acquisition, during pretraining, of a 
set of observing responses to distinctive 
letters in the stimulus words. These 
learned responses served to make the 
words in the vocabulary list more dis- 
tinctive at the beginning of learning for 
Group S than for Groups D and F. It is 
not possible to characterize the precise 
nature of these observing responses from 
the present data. With verbal Ss it is 
likely that these responses are mediated by 
implicit verbal responses. In support of 
this latter possibility, a few Ss reported 
they matched the words by looking at first 
letters, or by a letter which was in their 
name. There was no indication, however, 
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that the majority of Ss knew, or were 
using letter names as a basis for medi- 
ating the matching process. 

In comparing Group D with Group F, 
the evidence showed that the acquisition 
of a set of observing responses suitable 
to differentiate one list of words did not 
significantly transfer, either negatively or 
positively, to the task of acquiring another 
set of observing responses with a different 
list of words. 


SuMMARY 


This experiment compared the per- 
formance of three groups of kindergarten 
children in learning to read a vocabulary 
list. The groups differed in their pre- 
training experience. Group S learned to 
match the same words that appeared in 
the vocabulary list; Group D matched 
different words; and Group F matched 
geometric forms. 

The response measure was the number 
of correct anticipations in 12 trials of the 
vocabulary learning list. Analysis of the 
data showed that Group S performed sig- 
nificantly better than Groups D and F. 
The difference between Groups D and F 
was not significant. 

The results suggest that word recog- 
nition skill is related to the development 
of specific visual discrimination skills. The 
results also agreed with the findings of 
previous psychological research in stimu- 
lus differentiation pretraining. 
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COLLEGE PRESS AND CHANGES IN STUDY PLANS 
OF TALENTED STUDENTS 


DONALD L. THISTLETHWAITE 
National Merit Scholarship Corporation 


Despite its importance, the concept of 
“recruiting” able persons for the intel- 
lectual disciplines will seem alien to some 
readers. The notion of proselytising vio- 
lates their self-concepts. At best, persua- 
sion seems an ineffective means of attract- 
ing promising young scholars and 
scientists into higher educational pro- 
grams. A broader use of “recruitment” is 
considered here: more generally, the proc- 
ess of attracting talented undergraduates 
into different careers may be seen to con- 
sist—not in persuading them to do things 
they do not wish to do, but rather—in 
helping them to discover how they can 
best realize their creative and productive 
potentialities. 

In the present investigation, changes 
in major field of study and in the level of 
training sought, were related to the col- 
lege experiences of winners and near- 
winners in a national scholarship com- 
petition. Information was obtained on the 
relative “holding power” and attractive- 
ness of different fields of study, the char- 
acteristics of students and faculty in these 
fields, the effects of intellectual and social 
atmospheres upon motivation to seek ad- 
vanced training, and the traits of instruc- 
tors which had the greatest influence 
upon these talented students. An under- 
standing of the social systems influencing 
students to specialize in certain fields 
and to seek graduate or professional train- 
ing might enable us to create college 
environments in which more students 
could develop their talents.’ 

*This research was partially supported 
by the National Science Foundation, the 
Old Dominion Foundation, and the Na- 
tional Merit Scholarship Corporation. The 
author is indebted to Laura Kent and to 
John L. Holland for a critical reading of 
the earlier version. 


PROCEDURE 


Sample. In May, 1959, 546 winners 
(Merit Scholars) and 1,480 near-winners 
(finalists who received Certificates of 
Merit) in the 1956 National Merit Schol- 
arship competition were surveyed by mail. 


——— 


The survey was limited to students who 
| 


had responded to previous surveys and 
who were enrolled in college. Of those to 
whom questionnaires were sent, 117 were 
considered ineligible for inclusion in the 
study. (This group consisted of 40 who 
returned incomplete questionnaires, 44 
who had withdrawn from college, 12 who 
had transferred to a new college, 17 who 
had moved and left no forwarding address, 
3 who had entered religious seminaries, 
and one who died.) Returns were received 
from 1500 of the eligible students: 469 
Scholars (92% returns), and 1,031 C of 
M winners (75% returns). 

All of the 1500 respondents took the 
College Entrance Examination Board tests 
during their senior year in high school 
(1956) ; 99% had SAT verbal scores and 
84% had SAT mathematical scores which 
placed them above the 95th percentile 
relative to the nation’s high school seniors 
(Levine, 1958). Two-thirds of the group 
had verbal scores and half had mathe- 
matical scores which placed them above 
the 95th percentile relative to typical 
College Board examinees. The median 
SAT verbal score for the group was 685, 
and the median SAT mathematical score 
was 691. 

The sample—987 men and 513 women 
—includes students from 327 different 
American colleges and universities. 

Changes in study plans. Information 
about changes in the direction of study 
plans was elicited using a list of 61 sub- 
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ject-fields, identical to those used by Iffert 
(1957). The student was instructed to: 


Write the number (1) before the sub- 
ject in the list below which corresponds to 
your present major field of study. If your 
major interest has changed since you en- 
tered college write the number (2) before 
the subject in which you planned to major 
when you entered college. 


To estimate the impact of undergradu- 
ate experiences upon plans to seek ad- 
vanced training each student was asked, 
“Has the level of training you aspire to 
achieve changed since you entered col- 
lege?” The three fixed alternative re- 
sponses to this question are shown in 
Table 3. 

Classification of fields of study. The stu- 
dent’s field of concentration was classified 
into one of five broad areas: natural 
sciences (astronomy, chemistry, engineer- 
ing geology, mathematics, metallurgy, 
physies) ; biological sciences (agriculture, 
bacteriology, biochemistry, biology, bot- 
any, nursing, physiology, predentistry, 
premedicine, zoology) ; social sciences (an- 
thropology, economics, education, geog- 
raphy, history, international relations, po- 
litical science, psychology, social work, 
sociology) ; arts and humanities (architec- 
ture, English, fine arts, language, music, 
philosophy, speech, dramatic arts); and 
unclassified fields (accounting, business ad- 
ministration, home economics, journalism, 
prelaw, religious education, and other spe- 
cialties not elsewhere classified). Within 
each of these broad areas the most popular 
fields of study—at the end of the third year 
of college—were as follows: engineering 
(35% of the students in natural sciences) ; 
biology (24% of the students in biological 
sciences); history (35% of the students 
in social sciences); and English (49% of 
the students in the arts and humanities). 

Faculty and student press scales. 


Thistlethwaite (1959a, 1959b) has shown 
that variations between colleges in the pro- 
portions of their alumni who subsequently 
earned the PhD are related to differences 
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in intellectual and social atmospheres. In 
the earlier study, selected items from the 
College Characteristics Index (Pace & 
Stern, 1958; Stern, 1958) had _ been 
grouped on the basis of similarity of con- 
tent, taking care to make each item- 
cluster descriptive of either faculty or stu- 
dents but not both; items were then 
discarded which (a) did not differentiate 
between “high” and “low” productivity 
colleges, or (6) had low correlations with 
total provisional scores of items with 
which they were clustered on the basis of 
content. For the purposes of the present 
study, additional items were written for 
each cluster and each item-cluster edited 
so that each scale contained 10 items, five 
keyed so that a True response is weighted 
positively and five keyed so that a False 
response is weighted positively. Scale re- 
liabilities (Kuder-Richardson Formula 
20), sample items, and responses weighted 
positively for each of the items are shown 
in Table 12 In answering faculty press 
items, students were instructed to describe 
the facilities, faculty, and requirements in 
their present major field of study; in 
answering student press items, they were 
instructed to describe the activities and 
interests of students in their living group 
and classrooms. 


REsvuLTs 


Changes in Study Plans 


Major fields of study. During the three 
years 1956-59 the proportions of boys and 
girls majoring in the natural and biological 
sciences decreased while the proportions 
majoring in the social sciences and human- 
ities increased (Table 2). Although the 


*Complete lists of items in each scale, 
and keys used in scoring items have been 
deposited with the American Documenta- 
tion Institute. Order Document No. 6328, 
remitting in advance $1.75 for microfilm 
or $2.50 for photocopies. I am indebted to 
C. Robert Pace and George G. Stern for 
permission to incorporate items from their 
College Characteristics Index in these scales. 
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TABLE 1 


RELIABILITIES AND REPRESENTATIVE ITEMS OF FacuLTY AND STUDENT 


Press 


Press ScALES 








Scale 


_ Most Representative Item 
(Highest item-total score correlation) 





Faculty scales: 


Scale Reliability 


Achievement Standards set by the professors are not particularly .73 
hard to achieve. (F) 

Affiliation The professors really talk with the students, not just .70 
at them. (T) 

Compliance Frequent tests are given in most courses. (T) .57 

Directiveness Instructors clearly explain the goals and purposes .50 
of their courses. (T) 

Enthusiasm Lectures and class discussions are typically vigorous .77 
and intense. (T) 

Humanism Student interest in understanding and criticizing .75 
important works in art, music, and drama is en- 
couraged by the faculty. (T) 

Independence There is little opportunity for pursuing independent 60 
study under the supervision of faculty members. 
(F) 

Pragmatism The academic atmosphere is practical, emphasizing .69 
efficiency and usefulness. (T) 

Supportiveness Some professors here tend to belittle the students. 44 
(F) 

Student scales: 

Achievement Most students here are genuinely interested in 74 
learning and studying. (T) 

Aggression Personal rivalries are fairly common. (T) 59 

Breadth of interests Most students have very little interest in round 57 
tables, panel meetings, or other formal discussions. 
(F) 

Competition The competition for high achievement is intense. (T) 77 
Humanism There is a lot of interest here in poetry, music, paint- 78 
ing, sculpture, architecture, etc. (T) 

Participation Student pep rallies, parades, dances, carnivals or 57 

demonstrations occur very rarely. (F) 
Reflectiveness Books dealing with psychological problems or 76 
personal values are widely read and discussed. (T) 
Scientism When students get together they seldom talk about 76 
science. (F) 
.62 


Social Co 


nformity 


Proper social forms and manners are important here. 


retention rate for boys in the natural sci- 
ences (80.6%) was significantly greater 
(p < 01) than for boys in the biological 
sciences (64.1%) or in the arts and hu- 
manities (68.6%), male enrollment in the 
former fields decreased. The explanation 
for this paradox lies in the failure of the 
natural sciences to recruit new students. 
The natural sciences and the biological 
sciences recruited a significantly smaller 
proportion (p < 01) of their present 


male majors from other fields than did 
the social sciences or arts and humanities. 
For example, the social sciences recruited 
3.12 new males for each boy who with- 
drew, while the natural sciences recruited 
only one new male student for each three 
who withdrew. The retention rate for 
girls in the arts and humanities (75.2%) 
was significantly greater (p < .01) than 
for girls in the natural sciences (60.0%), 
social sciences (59.8%), or biological sci- 
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TABLE 2 


DisTRIBUTIONS OF StupY PREFERENCES OF 987 Boys ANp 513 GIRLs aT 


BEGINNING 
Percentage Majoring 
in Each Field in 
Field of Study 


1956 1959 
Males (N = 987) 

Natural Sciences 62.8 54.6 
Biological Sciences 10.4 8.3 
Social Sciences 10.8 16.3 
Arts and Humanities 8.7 11.8 
Unclassified fields 7.3 9.0 
Total (All fields) 100.0 100.0 

Females (N = 513) | 
Natural Sciences 26.3 20.5 
Biological Sciences 12.3 11.1 
Social Sciences ;} 19.9 22.2 
Arts and Humanities | 27.5 35.5 
Unclassified fields | 14.0 10.7 
Total (All fields) 100.0 100.0 





Remaining in Field, present Majors in 


or CoLLEGE AND THREE YEARS LATER 


Percentage of 


Initial Majors Percentage of | 


Ratio of Recruits 
to Withdrawals 


Field Recruited in Each Field 


——~| from Other Fields | 
(3 years later) 


80.6 7.2 32:1 
64.1 19.5 43:1 
75.5 50.3 3.12:1 
68.6 49.1 2.11:1 
59.7 | 51.7 1.59:1 
75.8 | 24.2 
| 

60.0 | 22.9 44:1 
50.8 | 43.9 81:1 
59.8 | 46.5 1.23:1 
75.2 | 41.8 2.17:1 
50.0 34.6 51:1 
61.6 | 38.4 








ences (50.8%). Like boys, girls found 
the natural sciences least attractive as a 
new field of study: the natural sciences 
recruited a significantly smaller (p < .01) 
proportion of their present female majors 
from other fields than did the biological 
sciences, social sciences or arts and hu- 
manities.” 

Plans to enter graduate or professional 
schools. After three years of college, ap- 
proximately one-third of these students 
are planning to do the three or more years 
of graduate study required for a PhD 
degree, while one in seven expect to get 
a professional degree in medicine or law. 


*Comparisons of the percentages of boys 
maintaining their interests in specific fields 
over three years with those reported by 
Iffert (1957, p. 56) indicate that Merit 
finalists tend to be much more stable in 
their study plans than the typical college 
student. Iffert reports that the percentages 
of boys maintaining their interests in Chem- 
istry, Engineering, Mathematics, and Phys- 
ies are 21, 61, 20, and 29 respectively. For 
boys in the present sample the correspond- 
ing percentages are 66, 94, 39, and 73 


The proportion of students planning to 
get advanced degrees varies considerably 
from one field of study to another. For 
example, 73% of those majoring in the 
biological sciences expect to seek the PhD, 
MD, or similar degree; while 56% of 
natural science majors, 43% of the social 
science majors, and 36% of those majoring 
in arts and humanities aim this high. 

Some of these differences undoubtedly 
ean be attributed to factors other than 
college experiences. Several previous 
studies suggest that fields vary in the pro- 
portions of students who make their ca- 
reer-decisions before entering college 
(Gropper & Fitzpatrick, 1959; Sibley, 
1948; Thielens, 1957). To remove the ef- 
fects of such early decisions from compari- 
sons of the proportions in each field mak- 
ing decisions to seek more advanced 
training, we have excluded all students 
who reported that they planned to seek 
one of the highest degrees (PhD, MD, 
ete.) and that their plans had not changed 
since entering college. These students pre- 
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TABLE 3 


REPORTED CHANGES IN HIGHEST LEVEL OF EpucaTION STUDENTS PLAN TO COMPLETE 











| Percentage Giving Each Response among Students Majoring in Each Field* 


Response to: Has the level of | 





training you aspire to achieve 








changed since you entered | Natural Biological Social | Artsand | phe, Total 
college? | Sciences Sciences Sciences Humanities | ;y = 101) (N = 902 
| (V = 354) | (V = St) | (= 184) | WW = 212) | fo 2 
No. | 29 | 4 | 4 51 2 
Yes. I now plan to seek | a 65 48 | 49 43 48 
a more advanced de- | | 
gree 
Yes. I now plan to seek | ll 6 11 8 6 | 10 
a less advanced degree | 
| | | | 
Totals | 100 100 100 100 100 100 








* Students who, in 1959, planned to seek the PhD, MD, or similar degree and who reported they had not changed 


their plans since entering college are not included. 


sumably had made their decisions before 
entering college and their experiences could 
not result in their aspiring to a higher de- 
gree. 

The remaining students, of which there 
were 902, could either (a) raise their 
level of aspiration or (6) seek a less ad- 
vanced degree. Changes in aspirations, 
classified by major field of study, are 
shown in Table 3 for boys and girls com- 
bined. Approximately half of them re- 
port that they now plan to seek more 
advanced degrees.‘ Sixty-five per cent of 
those majoring in the biological sciences 
have raised their goals, while somewhat 
less than 50% of the students in other 
fields report such changes in plans; this 
difference is statistically significant (p < 
05). 

Before turning to an analysis of college 
experiences related to changes in the di- 
rection and level of career plans, we shall 
consider some of the differences between 
fields with respect to faculty and student 
press. Information about the intellectual 
and social atmospheres in difficult fields 
may help in understanding why many 


“Responses of the entire group of 1500 
students indicate that 62% have not changed, 
31% have raised, and 6% have lowered 
their aspirational levels. (One per cent gave 
no response to this item.) 


talented students change their study 


plans. 


Characteristics of Students and Faculty 
in Specialized Fields 

Student reports of the demands and 
pressures of the faculty in their major field 
and of students in their classroom and 
living groups indicate striking differences 
between the various specialized fields 
Press scales have been grouped in Table 
4 in clusters according to similarity of the 
pattern of deviations of the mean score 
for each field from the grand mean for 
all fields. It can be seen that 16 of the 
18 press scales significantly differentiated 
between fields. In general, the natural sci- 
ences and the arts and humanities are 
most dissimilar in faculty and student 
press. 

Natural sciences. The faculty in these 
fields is characterized by strong press for 
pragmatism, achievement, and _  com- 
pliance, by relatively little enthusiasm, 
affiliation, and supportiveness, and by 
weak press for humanism and independ- 
ence. Student culiures of majors in these 
fields show strong press for scientism, 


competition and aggressiveness, and weak 


press for humanism, reflectiveness, breadth 
of interests, and social conformity. 
Arts and humanities. Faculty and stu- 
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TABLE 4 


Facu.LtTy AND StupENT Press In Four Fietps or Stupy 











Deviation of Mean Score 


of Given Group from Grand Mean | 


| for Each Scale* 














Press Scale F ratio 

Natural Biological Social Arts and 

Sciences Sciences Sciences Humanities | 

Faculty scales: 
Humanism — 166 —33 +96 +102 154.2* 
Independence —74 —s& +49 +33 33.6* 
Enthusiasm —71 —11 +27 +654 21.3* 
Affiliation —40 —10 —6 +54 | 11.5* 
Supportiveness —29 +6 +9 +13 | 5.4* 
Pragmatism +87 +29 —36 —82 | 43 .9* 
Achievement +27 +27 —19 — 36 5.4* 
Compliance +27 +19 —61 +4 | 9.5* 
Directiveness -9 +26 —3 -M | 1.7 

Student scales: 
Humanism —67 | —38 | +68 +47 21.3* 
Reflectiveness —64 | —13 +656 +10 | 12.3* 
Breadth of interests -38 | +10 | +24 | +4 7.9° 
Social conformity | —46 | +4 +4 +36 10.7* 
Scientism | +107 | 415 | -309 | a5 | 55.2" 
Competition +37 | +1 | +16 | —55 | 9.0* 
Aggressiveness +38 | +8 | -17 | —28 | 8.3* 
Participation | +21 +1 | —45 +24 7.3* 

Achievement |} -2 | -8 +24 -15 1.5 





* Grand means calculated weighting each field mean equally. Decimal points in deviation scores have been omitted 
(e.g., .27 is written 27). The highest and lowest deviation on each scale differentiating between fields is printed in 


italics. 


*p < .01. All significance tests were made by two-tailed tests. The number of student observers in each broad 
field, reading from left to right, was 642, 139, 275, and 297, respectively. 


dent press for majors in these fields gen- 
erally present the most extreme contrast 
with the above traits. Teachers tend to be 
enthusiastic, affilitative, and supportive, 
to exert strong press for humanism and 
independence, and relatively weak press 
for pragmatism and achievement. Student 
norms and values are characterized by 
humanism, social conformity and partici- 
pation, and by the absence of scientism, 
competition, and aggressiveness. 

Biological sciences. Both teachers and 
students resemble those in the natural 
sciences though they do not differ as 
markedly from teachers and students in 





other fields as do those in the natural 
sciences. 

Social sciences. Although the faculty 
and students in these fields resemble those 
in the arts and humanities on many scales, 
there are a number of exceptions. Spe- 
cifically, social science faculties are char- 
acterized by weak press for compliance, 
and by relatively little affiliation, while 
the student cultures of social science ma- 
jors emphasize reflectiveness, breadth 
of interests, and competition, placing little 
stress upon participation. Like the arts 
and humanities, the social sciences are as- 
sociated with faculties which score high 
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on humanism, independence and enthusi- 
asm, but low on pragmatism and achieve- 
ment, and with student cultures relatively 
high on humanism but 'ow on scientism. 


College Press and Plans to Seek Advanced 
Training 


Faculty behavior and press. Predictions 
made on the basis of the previous study 
(Thistlethwaite, 1959a, 1959b) are shown 
in the first column of Table 5. It can be 
seen that all 11 predictions were confirmed 
in that the groups predicted to have the 
higher means did in fact. Six of the 11 
differences for which predictions were 
made were statistically significant, as was 
one difference which was not predicted. 
Thus, the following relationships have 
been confirmed in two separate studies: 
college environments characterized by 
faculty affiliation or enthusiasm or faculty 
emphasis upon achievement, humanism, or 
independence are associated with in- 
creased motivation to seek advanced de- 


grees in the arts, humanities, and social 
sciences; college environments character- 
ized by a lack of faculty emphasis upon 
student compliance are associated with in- 
creased motivation to seek advanced de- 
grees in the natural and biological sciences. 
In addition, the present results indicate 
that teachers in the natural and biological 
sciences whom students perceive to be 
enthusiastic tend to be more effective in 
stimulating students to seek more ad- 
vanced training. 

Responses to the direct question, 
“What college experiences have influenced 
the level of training you plan to reach 
(e.g., BA, MA, PhD level) ?” likewise in- 
dicated that college instructors frequently 
influence the student’s decision to seek 
advanced training. Table 6 shows that 
54% of the students who raised their 
level of aspiration mentioned “a few in- 
structors who inspired me to emulate 
them,” compared with only 13% of those 
who lowered their level of aspiration and 


TABLE 5 
PREDICTED AND OBTAINED DIFFERENCES IN FacuLty PREss aS REPORTED BY STUDENT 





SAMPLES WITH DIFFERENT CHANGES IN LEVEL OF ASPIRATION 


Present Major Field 





Natural Sciences and Biological Sciences 


Arts, Humanities, and Social Sciences 





| 

_ 

| 

Faculty Press Scale* 
Mean for Ss 





Increasing Not Chang- 

| Aspiration | 38 Lave 
Achievement (+, +) 6.19 5.89 
Affiliation (+, +) 5.88 5.61 
Compliance (—, ...) 4.71 5.28 
Directiveness (—, ...) 7.24 7.39 
Enthusiasm (..., +) 6.82 6.14 
Humanism (..., +) 5.36 5.22 
Independence (..., +) 6.25 6.10 
Pragmatism (..., —) | 5.93 | 6.16 
a Ln eo 5.46 5.21 


Mean for Ss 


MEAN for Ss 


| Increasing Mean for Ss 


Mean Diff. | Level of /2°t Changing) yfean Dif. 
Aspiration, | (N 66) 
(N = 191)j mi | 
| +.30 | 5.9% | 5.43 | +.50* 
+.27 | 6.68 | 5.99 | +.68*** 
—s7*| 4.20 | 4.2 | —.01 
| —.15 7.27 | 7.07 | +.20 
+.68** 7.87 | 7.32 | +.55* 
| +.14 8.04 | 7.48 | +.56*** 
| +.15 7.72 7.07 | +.66*** 
— .23 4.25 4.48 | —.23 
+.25 6.08 5.68 | +.40 





® The first member of each pair of symbols in parentheses refers to the predicted direction of the differ erence among 
students majoring in the natural or biological sciences; the second member refers to the predicted direction of the 


difference among students majoring in the arts, humanities or social sciences. A dotted line ( 


prediction was made. 
* p < .05 (one-tailed test). 
** p < .05 (two-tailed test). 
*** » < .01 (one-tailed test). 
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TABLE 6 
CoLLEGE EXPERIENCES AND THE LEVEL OF TRAINING STUDENT PLANS TO COMPLETE 


Response to: What college experiences have influenced 
the level of training you plan to reach 
(e.g , BA, MA, PhD level)? 


A few instructors who inspired me to 
emulate them 
Evaluations of instructors which made me | 
think I had potentiality as a contribu- | 
tor to my present field of study 
Challenging and inspiring courses in my 
intended field of study 
Discovering the high ability of students 
with whom I had to compete in my in- 
tended field of study 
Nothing I have experienced here has in- | 
fluenced my educational plans 
A few instructors who alienated me 
Negative evaluations of instructors 
Dull and uninspiring courses in my in- 
tended field of study 


Percentage of Group Giving Each 
esponse 


Ss Who Ss Who Did Difference 


Ss Who tween Cc . 
Decreased | Not Change Inceeneed a ae 
Level of Level of Level of . 
Aspiration Aspiration Aspiration 
(N = 82) (N = 365) (N = 427) 
(1) (2) (3) 
13.4 | 21.9 | 54.1 +40.7** 
21.9 | 24.7 | 51.8 +29.9** 
| | 
17.1 23.3 | 46.4 | +29.3** 
4.4 | 16.7 18.3 —6.1 
11.0 | 38.4 4.2 | -6.8* 
| 
| 
14.6 5.5 4.4 —10.2** 
12.2 . 1.9 —10.3** 
29.3 12.9 18.3 —11.0* 








and because of nonresponse to the question. 

*p< .6. 

"p< Ol. 

22% of those who did not change their 
aspirations. Experiences which reassured 
the student of his abilities in his field of 
study and “challenging and _ inspiring 
courses” were also mentioned much more 
frequently by those who raised their as- 
piration levels. 

Student press. The results of scales 
designed to measure the norms and values 
of the student’s and living 
groups are not shown since none of the 
obtained differences for which predictions 
were made were statistically significant 
and only five of the nine were in the pre- 
dicted direction. There was no evidence 
that student press influenced the aspira- 
tions of Merit Scholars and Certificate of 
Merit winners. 


classroom 


Factors Influencing Choice of Major 
Field 
Identification with a __ stimulating 


teacher. Each student was asked to answer 


® Ns differ slightly from those shown in Table 5 because 101 students classified in other fields are included here 


the question, “What college experiences 
have influenced your choice of career 
field?” by checking on a list-of 12 alterna- 
tives those which applied. Half of the 
alternatives were designed to identify 
deterrents to remaining in an intended 
field of study, and half were designed to 
identify incentives attracting students into 
new fields of study. Among students re- 
cruited into one of the four major fields of 
study the two most popular “incentive 
responses” were: “evaluations of instruc- 
tors which made me think I had potential- 
ity as a contributor in my present field of 
study” (36%) and “a few instructors who 
inspired me to emulate them” (32%). 
The first was checked most frequently by 
students recruited into the biological sci- 
ences (44%) and the arts and humanities 
(40%). It was endorsed least frequently 
by students recruited into the natural sci- 
(29%). The second was endorsed 
most frequently (36%) by students trans- 


ences 
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ferring into the arts and humanities and 
least frequently (24%) by students trans- 
ferring into the biological sciences. 
Additional evidence that identification 
with a stimulating teacher is important 
was obtained by a more indirect question. 
Each student was asked to “list three 
faculty members (either within or outside 
your major department) in order of the 
degree to which you feel they contributed 
to your desire to learn in your chosen fields 
of study.” The departmental affiliations 
of the faculty member listed in first po- 
sition (as most influential) were then 
analyzed and related to changes in the stu- 
dent’s major field of cencentration. As- 
suming that role models partly determine 
the student’s choice of a major field, we 
expected that (a) students who maintain 
their interest in a field would more fre- 
quently name teachers in that field than 
would students who withdraw from the 
field, and (6) recruits into a field would 
more frequently name teachers in that 
field than would students who withdraw 
from the field. Both of these predictions 
were confirmed. Eighty-six percent of the 
stable students, and 73% of the recruits, 
named teachers in the field corresponding 
to their present major field of concentra- 
tion, while only 13% of the students with- 
drawing from a field named teachers from 
that field. These data suggest that failure 
to find a stimulating teacher in one’s initial 
field of concentration is a deterrent to the 
student’s remaining in that field, and that 
he is frequently attracted into new fields 
by exposure to more stimulating teachers. 
Deterrents to continued study in a field. 
The “deterrent responses” to the question 
“What college experiences have influenced 
your choice of career field?”, mentioned 
above, suggest that many of the students 
who drop out of a field do so because their 
expectations of certain career-fields are 
proved to be incompatible with reality. 
Among students who withdrew from one 
of the four major fields, the most popular 
(a) “knowledge I ob- 


responses were: 
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tained in college about my intended field 
of study” (53%), (6) “dull and unin- 
spiring courses in my intended field of 
study” (37%), and (c) “information I 
obtained about opportunities to earn a 
livelihood in my intended field of study” 
(18%). The first two responses were most 
frequent among students withdrawing 
from the natural sciences (63 and 48% re- 
spectively) and least frequent among those 


withdrawing from the arts and human- | 


ities (44 and 18% respectively). Many 
talented students presumably chose their 
career fields without adequate occupa- 
tional information. The results suggest 
that if attempts to arouse the interest 
of students in careers as scientists, en- 
gineers, or teachers do not provide ac- 
curate information about the duties, roles, 
and rewards to persons in these specialized 
fields, such efforts may boomerang. 


Characteristics of the Stimulating Teacher 


It will be recalled that each student 
was asked to list three faculty members 
(from any department) in order of the 
degree to which he felt they contributed 
to his desire to learn. The student was also 
asked to describe on a check list the be- 
havior of the instructor who had the 
greatest influence upon his desire to learn. 
Selected characteristics of the course and 
instructor are listed in Table 7 in the 
order of frequency of mention. Students 
were nearly unanimous in reporting that 
influential teachers exhibited great en- 
thusiasm when discussing their fields. 
Also, over 80% of the students reported 
that these teachers had taken a personal 
interest in them and their work. 

Other traits attributed to these teachers 
varied significantly with the instructor’s 
field of study; these results, like those 
reported in Table 5, suggest that teaching 
techniques effective in one field may not 
necessarily be effective in other fields. For 
example, influential teachers in the natural 
sciences, are distinguished from those in 
the humanities and social sciences by 
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TABLE 7 


CHARACTERISTICS OF TEACHERS AND COURSES WHICH STUDENTS REPORT 
CoNTRIBUTED Most to THetr DestrE TO LEARN 









































iel 
. d Percentage of Teachers Named in Each Field Who 
nin- Exhibited Given Characteristic* 
of Percentage in All 
n I Characteristic of Course or Instructor | Natural Biological | Social Arts and | Pons eee Fields 
| Science Science Science | Humanities ( sss) 
na | Teachers | Teachers | Teachers | Teachers 
: | (N = 593) | (N = 113) | (N = 299) We =, $82) 
dy aM | @ (3) | (s) 
10st 
ing Exhibited great enthusiasm | 96.0 | 96.4 | 95.8 | 96.8 | 96.2 
g when discussing his field 
re- —_ pone interest inmej| 77.0 | 76.8 | 81.9 | 86.8 80.6 
lose apd my work | 
Frequently pointed to | un- | 75.3 | 87.4 | 85.0 | 64.7 75.7 
—_ solved problems on which re- 
any search is needed 
heir Placed the highest priority 79.1 89.2 72.5 | 62.6 74.3 
upon helping the student to 
pa- obtain a broad familiarity | 
rest with knowledge and ideas in 
rest his field 
~ | Always discussed correct an- 87.0 64.8 53.8 | 53.2 69.1 
en- swers to tests 
ac- Course meticulously organized 77.0 72.3 64.9 58.3 69.1 
_ and progressing systemati- 
wes, cally from topic to topic 
zed Was subject to change accord- 58.7 61.3 62.6 71.3 63.1 
ing tc the Ss’ needs and in- 
terests | 
h | Allowed time during class for 51.2 54.1 67.6 78.0 | 62.0 
er | _ extensive student discussion 
ent | Frequently encouraged quali- 62.0 66.8 64.8 57.8 | 61.9 
. —~ core to do graduate 
ETS wor 
the | Provided personal evaluations 49.9 54.1 61.5 76.4 59.7 
4 which made me realize I 
te might become a creative or 
ulso | <* ce worker in my 
. eld 
be Placed the highest priority 46.8 35.1 66.1 72.1 56.6 
the upon developing the stu- 
Wn. dent’s critical abilities 
und Required one or more term pa- 40.6 55.9 65.8 71.1 55.4 
pers or laboratory rojects 
the Usually outlined the day’s lec- 49.7 51.8 49.5 41.2 47.7 
nts ture or discussion at the be- 
hat ginning of the class period | 
. Often tried to relate the course 19.8 45.0 | 68.6 | 72.6 | 46.6 
en- to » beoad human values and | 
s goals 
cs. Meticulously documented 31.9 39.6 | 50.2 41.1 | 39.0 
ted sources of information 
nal Encouraged competition | 45.7 | 29.1 | 31.6 30.0 | 37.1 
among class members 
Made special efforts to give 35.2 31.5 | 34.1 | 4.8 | 37.1 
ers recognition to students who | 
or’s did exceptional work 
= Was primarily a reading course | 6.8 14.3 | 55.9 52.3 30.0 
os¢ | Was frequently critical of the 16.4 | 22.5 | 42.5 | 34.7 27.5 
ing ideas and work of other 
not workers in his field | 
| | | 
‘or 


* In calculating percentages each teacher nomination was treated as if it designated a unique teacher even though 
ral occasionally students attending the same college nominated the same teacher. The Ns for percentages deviate slightly 

in from those given above because some Ss failed to respond to particular checklist items; the average numbers of non- 
: respondents for Columns 1-4 were 20, 2, 8, and 11, respectively. The smallest differences found to be statistically sig- 
by nificant (p < .01) between corresponding percentages in different columns were as follows: Columns 1-2, 12.1; Columns 
1-3, 9.7; Columns 1-4, 9.2; Columns 2-3, 16.7; Columns 2-4, 14.0; Columns 3-4, 9.9. 
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their meticulous organization of courses, 
and by more frequent use of such peda- 
gogical techniques as discussing correct 
answers to tests and encouraging com- 
petition among students. They also require 
term papers and laboratory projects less 
frequently than their colleagues in other 
fields. The influential teacher of biological 
distinguished by 
broad 


sciences seems to be 
stress upon giving the student a 
familiarity with knowledge and ideas in 
his field, and by less emphasis upon de- 
veloping the student’s critical abilities. At 
the same time, like the science 
teacher, he more frequently points to un- 


social 


solved problems on which research is 
needed. 
Students describe stimulating teachers 


and courses in the arts, humanities, and 
social sciences in similar terms: more typi- 
cal of these teachers than of teachers in 
other fields are “reading” courses, attempts 
to relate the course to broad human values 
and life goals, the use of classroom dis- 
cussion and term papers or projects as 
pedagogic techniques, and stress upon 
developing the student’s critical abilities. 
Social science teachers are distinguished 
by their relatively strong emphasis upon 
documentation of sources of information 
and their tendency to be critical of the 
ideas and work of colleagues. Teachers in 
the arts and humanities more frequently 
reassure students of their creative or 
productive potentialities, and make more 
systematic attempts to give recognition to 
students who do exceptional work. The 
traits which distinguish these teachers 
are consistent with the greater press for 
humanism and independence and the 
greater supportiveness of these faculties 
noted in Table 4. 
DISCUSSION 

The greater success of the humanities 
and social sciences in recruiting new stu- 
dents may be attributed to several factors: 
the greater irreversibility of decisions not 
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to major in the natural and biological sci- 
ences (Sibley, 1948); the image of teach- 
ers in the humanities and social sciences 
as being more enthusiastic, more tolerant 
of independence, and pragmatic 
(Table 4); the greater use by the teacher 
of humanities and social sciences of peda- 
gogic techniques which encourage students 
to play the role of a scholar or scientific 
investigator (Table 7). 

Students majoring in the 
sciences more frequently (p < .05) raised 
their educational aspirations while in col- 
lege (Table 3). Although the trait “fre- 
quently encouraged qualified students to 
do graduate work” was more often attrib- 
uted to teachers in the biological sciences 
than to teachers in the arts and humanities 
(Table 7), the results do not suggest any 
simple explanation for this difference in 
“productivity.” 

The failure to replicate previous find- 
ings (Thistlethwaite, 1959a, 1959b) con- 
cerning the effects of student values and 
norms upon motivation to seek advanced 
training may be interpreted in terms of 
the personality characteristics of Merit 
finalists described by Holland (1959, 
1960). Finalists have means on Gough’s Ai 
independence) 


less 


biological 


scale (achievement via 
which place them about two standard de- 
viations above the norm for unselected stu- 
dents (Gough, 1957). Similarly, on Cat- 
tell’s 16 PF scales (1957) finalists differ 
from the average college student in that 
they are more dominant (E), more radical 
(Q,), and more self-sufficient (Q.). Festin- 
ger’s theory predicts that “if persons who 
are very divergent from one’s own opinion 
or ability are perceived as different from 
oneself on attributes consistent with the 
divergence, the tendency to narrow the 
range of comparability becomes stronger” 
(1954, p. 133). A possible explanation is 
that many of these students have indeed 


narrowed the range of individual dif- 


ferences within which they compare them- 
selves to their peers, and are consequently 
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iess susceptible to peer group norms than 
groups less highly select in terms of abil- 
ity. 

Departmental differences in faculty and 
student press have implications for evalu- 
ations of higher educational institutions 
(Knapp & Goodrich, 1952; Knapp & 
Greenbaum, 1953; Traxler, 1957; Thistle- 
thwaite (1959a, 1959b). Table 4 suggests 
that departments differ considerably in 
what they consider to be the objectives of 
the undergraduate curriculum; differences 
in student norms and values suggest the 
output of a department may be a function 
of the characteristics of students it at- 
tracts. If colleges are to be compared on 
the alumni 
obviously necessary to adjust such assess- 
ments so that differences in the propor- 
tions of students in different fields of spe- 
cialization not confounded with 
differences in educational effectiveness. 


basis of achievements it is 


are 


SUMMARY 


This study reports the results of a follow- 
up, at the end of the third year of college, 
of 1500 talented students who had received 
recognition in a national scholarship com- 
petition. Student reports of faculty and 
student press were related to changes in 
the direction and level of career choices. 
The major results may be summarized as 
follows: 

1. The natural sciences were relatively 
successful in retaining talented students 
who initially entered these fields, while the 
biological sciences had less holding power; 
both fields, however, attracted new stu- 
dents less frequently than the arts, human- 
ities, and social sciences. 

2. Experiences in college which encour- 
aged the student’s belief that he may be 
successful in a career-field, provided role 
models for imitation, or proved that his ex- 
pectations about certain career-fields were 
imappropriate, were related to the stability 
and direction of change of study plans. 
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3. Faculties which students perceived as 
enthusiastic, warm and informal in their 
relationships with students (affiliative), 
and as stressing achievement, humanism, 
and independence were associated with 
more frequent changes in students’ plans 
to seek advanced training in the arts, hu- 
manities, 

4. Faculties which students perceive as 
enthusiastic and as great 
press for compliance were associated with 


and social sciences. 


not exercising 
more frequent changes in students’ plans to 
seek advanced training in the natural and 
biological sciences. 

5. Traits attributed to teachers who were 
most influential upon students varied con- 
siderably from field to field. 

Implications for further research on the 
pressures and activities which characterize 
stimulating teachers, disciplines, and col- 
leges are discussed. 
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FACULTY PERCEPTIONS OF STUDENT SUCCESS 
IN TEACHER EDUCATION 


ROGER A. MYERS’ 
Ohio State University 


Successful accomplishment in any com- 
plex social situation is difficult to define. 
When prediction is the goal, this problem 
of definition is especially crucial, as the 
OSS (1948) and Kelly and Fiske (1951) 
studies have demonstrated. In the area of 
teacher education, attempts to predict suc- 
cessful student accomplishment have been 
limited in scope and effectiveness by the 
lack of understanding of what constitutes 
performance that is judged successful. In- 
vestigators who needed acceptable indexes 
for evaluating their predictions have been 
forced to beg the question of definition and 
employ crudely unidimensional criteria, 
such as grade averages, which are assumed 
to represent the complicated concept of 
success. Although this is a widely accepted 
practice, most investigators realize that the 
object of their prediction is not the cri- 
terion measure itself, but rather the stand- 
ard of performance which it represents. 

One approach to the problem of defining 
successful accomplishment is suggested by 
the work of Pepinsky (1956). In his re- 
search on productive behavior he has con- 
ceptualized a social system which is made 
up primarily of two classes of people, ac- 
tors and task-setters. While it is the actor 
whose behavior is studied and predicted, 
the role of the task-setter is a crucial one. 
It is he who decides what requirements are 
made of the actor, specifies the rules for 
meeting them, evaluates the actor’s behav- 
ior, and rewards and punishes him accord- 
ingly. If one were to attempt to define suc- 
cessful accomplishment within such a social 
system, it would appear reasonable to be- 
gin by studying the task-setter in order to 
determine how he defines success. Since he 


, *Presently at the University of North Da- 
ota. 
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is the one who determines success by his 
process of rewarding and punishing, the 
way in which he defines success is of para- 
mount importance. 

In the educational setting, studying the 
faculty member’s views of successful stu- 
dent accomplishment is an infrequent oc- 
currence. One recent exception is the work 
of Stern, Stein, and Bloom (1956). These 
investigators held a series of meetings with 
the faculty of a teacher education program 
during which they determined the things 
the faculty valued and the kinds of opin- 
ions and judgments they made about stu- 
dents. With this information the investi- 
gators constructed a hypothetical model of 
a student who would be regarded as suc- 
cessful. They then examined the students 
in the program, compared them to the 
model, and predicted how the faculty mem- 
bers would evaluate them. Although their 
predictions were extremely successful, the 
time and expense of their method for inves- 
tigating faculty views of success limit is 
usefulness. 

The present study was designed to in- 
vestigate how the faculty of another 
teacher education program define success- 
ful student accomplishment. The specific 
purpose was to learn from the faculty 
members what characteristics of students 
they considered desirable and to construct 
dimensions that would represent the way 
these characteristics were organized in their 
perceptions. An attempt was made to con- 
duct this investigation using research tools 
which are largely clerical and therefore 
more easily reproduced. 


METHOD 


Two-hour interviews were conducted 
with 12 members of a department of edu- 





236 


cation faculty. Each of the faculty mem- 
bers interviewed was, in the opinion of the 
department chairman and other observers, 
a critical leader or representative of a focal 
position in the department. The interviews 
were unstructured except for the inter- 
viewer’s request that each faculty member 
describe, in his own words, the behavior of 
a good student and a poor student in the 
department. The sample size of 12 was de- 
termined when, in the judgment of the in- 
terviewer, redundancy occurred in the in- 
terview and the addition of faculty 
members to the sample produced no new 
descriptive material. 

From typescripts of the interviews the 
investigator extracted 280 statements 
which the faculty members had used in de- 
scribing the good and poor students. Of 
these 280 statements, 169 were finally 
judged to be usable on the basis of clarity, 
objectivity, and general applicability. The 
descriptive statements were then given to 
eight judges who sorted them into 15 mean- 
ingful categories. These categories were 
given the following titles: hostility, flexi- 
bility, intellectual curiosity, general intelli- 
gence, substantive preparation, autonomy, 
rejection, order and detail, nurturance, self- 
confidence, energy and enthusiasm, growth 
potential, sociability, dominance, and de- 
pendence. 

Items from each category were systemat- 
ically spaced throughout a checklist so that 
statements from a single category did not 
confront the reader more frequently than 
once in ten. The order of the pages of the 
checklist was randomized in an attempt to 
control error which might have resulted 
from the position of a statement on the 
checklist. 

Of the 61 members of the department 
faculty who were on duty, 6 had no con- 
tact with the undergraduate students and 
felt that they could not appropriately re- 
spond to the check list. Each of the re- 
maining 55 members was given two copies 
of the check list with instructions to de- 
scribe one good student and one poor stu- 
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dent whom he knew. The students were to 
be in their third or fourth year of college 
and in the teacher education program. The 
instructions for choosing a good student 
suggested, “...one who closely approxi- 
mates the realization of the objectives you 
hold in your teaching and work with stu- 
dents.” Instructions for choosing a poor 
student suggested, “...one who has man- 
aged to remain in college but who has 
not developed in accordance with your 
goals for students.” The descriptions were 
accomplished by assigning a rating on a 
five-point scale to each of the descriptive 
statements, according to the degree to 
which it was characteristic of the student 
being described. Fifty of the 55 faculty 
members returned completed checklists. 

The resulting ratings were factor ana- 
lyzed by the Wherry-Winer method for use 
with large numbers of items (1953). First, 
category scores were obtained by summing 
the ratings of the descriptive statements in 
each category. Then an item analysis was 
performed for each category. As a result of 
these item analyses, two of the categories, 
dominance and dependence, were dropped 
and the items in them were assigned to an 
unclassified pool. The remaining 13 cate- 
gories, used as subtests, were ir: ~rcorre- 
lated. The resulting matrix included some 
correlations that were unusually high and 
suggested that some of the subtests did not 
meet the requirement for linear independ- 
ence imposed by the method. A test for 
linear independence resulted in the drop- 
ping of five more categories and the as- 
signment of more items to the unclassified 
pool. These categories were flexibility, gen- 
eral intelligence, substantive preparation, 
energy and enthusiasm, and growth poten- 
tial. 

The remaining matrix included eight 
categories: hostility, intellectual curiosity, 
autonomy, rejection, order and detail, nur- 
turance, self-confidence, and sociability. A 
multiple-group centroid analysis of this 
matrix yielded a general factor and three 
group factors. From these factors a trans- 
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formation matrix was derived which, when 
multiplied by previously computed corre- 
lations between statements and categories, 
produced orthogonal loadings for each of 
the statements on each of the factors. 


RESULTS 


The four factors obtained were assumed 
to represent dimensions of successful stu- 
dent behavior in the perception of the fac- 
ulty members. Each factor was identified 
by the descriptive statements which had 
loadings on it. Because of the limitation of 
space it is impractical to list all of the 
statements and factor loadings here. Table 
1 includes a sample of these chosen from 
the statements with the highest factor load- 
ings.” 

Factor I is 
most of the descriptive statements have 
high loadings. 
this factor represents a general perceptual 
orientation of the faculty toward accept- 


a general factor on which 


It was hypothesized that 


able student behavior. It appears to be a 
generally shared stereotype of students 
which arises from actual and perceived sim- 
the The 


strength of the factor suggests a high de- 


ilarities among statements. 
gree of commonness among the faculty 
members’ views of student success. In a 
similar study, Stoltz (1959) has pointed 
out the contribution of halo effect to this 
kind of factor. 

The name given to this factor was Gen- 
eral Acceptability. As a dimension of stu- 
dent behavior it is assumed to represent 
some kind of global perception on the part 
of the faculty members. 

Factor II was named Readiness to Learn. 
It contains statements which describe be- 
havior that the judges categorized as in- 
tellectual curiosity, autonomy, self-confi- 
dence, energy, and 


general intelligence, 


*A complete list of the statements and 
their factor loadings is included in R. A 
Myers, “A factor analytic study of faculty 
views of student success.” Unpublished doc- 


toral dissertation, Ohio State University, 
1959, pp. 46-51, 56-58, and 62-64. 
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substantive preparation. It is characterized 
by the student who comes to the learning 
situation with all of his credentials in or- 
der. He has made a habit of doing what is 
expected of him and in addition, when left 
to his own devices, he will use his time and 
energy profitably. He is not only capable 
of dealing with higher level abstractions, 
but iso enjoys going beyond the immediate 
in his search for intellectual stimulation. 
Finally, he is free from neurotic fears which 
might decrease the vigor of his search for 
knowledge and complicate his relationships 
with his professors. 

It is apparent that this dimension is 
highly related to the general capacity var- 
iable that is widely recognized as a major 
determinant of success in any academic 
program. Added to this general capacity, 
however, are strong motivation relevant to 
the goals of the department and personal 
security which permits unencumbered pro- 
fessor-student relationships. 

Since Factor III is composed largely of 
statements that the judges originally clas- 
sified as nurturance and sociability, it was 
given the name of Social Sensitivity. The 
behavior described occurs most frequently 
in the student teaching situation and in the 
student’s daily relationships with his peers. 
It includes such things as initiating and 
directing group action that is facilitative 
to learning; observing, understanding, and 
helping pupils and peers; being tolerant of 
the weaknesses of others; and so on. The 
emergence of this dimension coincides to 
some extent with the findings of Ohlsen 
and Schultz (1954), Tyler (1954), and 
Stern, Stein, and Bloom (1956). 

Factor IV is composed of negatively 
loaded statements from the hostility and 
rejection categories of the judges and posi- 
tively loaded statements from the cate- 
gory called order and detail. The negatively 
loaded statements include behavior that 
suggests the acting out of rebellious feel- 
ings toward the authority figures repre- 
sented by the professors, lack of respect 
for the institution and its goals, and a 
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TABLE 1 
Factor LoaDINGS FOR A SAMPLE OF CHeEcK List STATEMENTS 
Factor 
Statement 
I II Ill IV 

Is confident in examinations 65 66 —06 Os 
Will attend a university sponsored lecture even if it 62 65 —02 06 

is not required 
Has a well-developed vocabulary 70 63 05 05 
Writes papers with coherent conclusions 72 62 06 16 
Can illustrate his abstractions with ease 69 60 04 04 
Will question broad generalities, even outside his 63 59 00 —04 

own area of knowledge 
Can be critical of a professor’s point of view 47 59 01 —20 
Has arrived at his own value system 53 58 02 13 
Has good grammatical habits 70 56 06 09 
Is eager to have his work evaluated 68 56 —06 25 
Has a broad general education 65 55 03 07 
Has the ability to relate to children 66 —18 53 11 
Gets respect from his pupils or peers 72 04 45 29 
Has patience with slow learners 58 —25 44 13 
Can get to the experience level of students he is 70 —11 43 12 

teaching 
Is alert to facial expressions of pupils or peers 62 —06 43 —06 
Says things in class that get the rest of the group in- 68 16 42 —08 

volved 
Can be firm and decisive with pupils or peers 69 07 41 —04 
Has a great deal of warmth in interpersonal rela- 65 00 40 30 

tionships 
Is able to get a group interested in a project 75 03 38 12 
Projects himself into the thinking of his pupils or 68 19 32 

peers 
“Pops off’’ in class —22 08 02 —70 
Wants to do things his way, regardless of the rules —38 08 —05 —70 
Is generally “fed up’’ with the College of Education —47 00 —01 — 66 
Obstructs class progress by thinking up reasons why —37 02 01 —63 

things won’t work 
Thinks he knows more than his professors —36 15 —03 —62 
Expresses negative feelings about course material —23 10 06 —60 
Is punctual 46 38 11 58 
Complains about classes being too large —32 02 16 —57 
Is hostile to his fellow students —53 —18 —01 —652 
Is uncooperative in group projects —41 05 —18 —48 
Turns in required work on time 56 38 18 46 
Shows great loyalty and deference to authorities —14 —07 —09 44 





Note.—All decimals are omitted 





characteristically hostile and uncooperative 
orientation to others. The positively loaded 
statements describe behavior that suggests 
compliance with daily requirements, or- 
derly work habits, and attention to details. 
The combination of these two kinds of 
statements seems to define a dimension 
that has to do with controlling aggressive 


emotions vs. expressing them. The factor 
was given the name of Impulse Control. 


SuMMARY 


Fifty members of a department of edu- 
cation faculty described the behavior of 
good and poor students in a teacher educa- 
tion program using a check list of 169 de- 
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scriptive statements. A Wherry-Winer fac- 
tor analysis of these statements resulted in 
the extraction of four factors which were 
assumed to represent dimensions of student 
behavior as perceived by faculty members. 
The factors have been titled General Ac- 
ceptability, Readiness to Learn, Social 
Sensitivity, and Impulse Control. 

These dimensions help to define the 
standard of performance that the faculty 
of one teacher education program expect 
of their students. Further study could lead 
to the development of ways of evaluating 
students on these dimensions for the pur- 
pose of arriving at a multiple criterion of 
success. 
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CHANGES IN SCORES ON THE NORTHWESTERN 
MISCONCEPTIONS TEST IN SIX ELEMENTARY 
PSYCHOLOGY COURSES 
W. J. McKEACHIE 
University of Michigan 


In 1950 the Division on the Teaching of 
Psychology formed a Committtee on Re- 
search in the Teaching of Psychology’ 
which in its three years of existence spon- 
sored a number of APA programs, sur- 
veyed the need for publication space for 
research in the teaching of psychology, and 
stimulated the writing of an article sur- 
veying research on the teaching of psychol- 
ogy (Birney & McKeachie, 1955). The 
committee also discussed the need for more 
comprehensive achievement tests for which 
normative data might be collected. While 
no agreement on construction of a test was 
reached, the members of the committee did 
decide to administer the Northwestern 
Misconceptions Test (Holley & Buxton, 
1950) to their classes to secure normative 
data for it. The test consists of 100 true- 
false items collected by Buxton from nu- 
merous sources, selected on the basis of 
item-analysis and difficulty. There are an 
equal number of true and false correct re- 
sponses. Newman, Duncan, Bell, and 
Brandt (1952) report a split-half reliabil- 
ity of .71. The test is apparently frequently 
used in general psychology courses, at 
least as part of the introduction to the 
course. 

By giving the test at a number of insti- 
tutions we hoped not only to gather nor- 
mative data on the types and number of 
misconceptions college students hold, but 
also to determine the effect of different in- 
troductory courses upon changing miscon- 
ceptions. 

*Members of the committee were F. K. 
Berrien, Everett Bovard, C. Ray Carpenter, 
Carl P. Duncan, Paul Fields, B. Von Haller 
Gilmer, Wilbert S. Ray, Clarence Young, and 
W. J. McKeachie, Chairman. 


9 


Test Administration 


The test was administered to the ele- 
mentary psychology classes at Carnegie 
Institute of Technology, Colgate, Michi- 
gan, Northwestern, Pennsylvania State, 
and Trinity, half the sections taking the 
test at the beginning of the semester and 
all sections taking it at the end of the 
semester. 


RESULTS 

Pre-Post course gains. Since there were 
no reliable differences between end-test re- 
sults for those who had had the pretest and 
those who had not, the total results will be 
reported for students who took both pre- 
and posttests at each institution. (See Ta- 
ble 1.) 

Although the overall gains at the par- 
ticipating institutions may be of interest to 
other institutions using the test, most psy- 
chologists will be more interested in re- 
sponses to individual items, since these give 
some clues to the beliefs our students have 
and the effect introductory courses are 
likely to have. Such results would be most 
convincing if the introductory courses rep- 
resented here had similar effects. 

Table 2 shows pre-posttest scores for 
selected items. Unfortunately, no item 
showed significant gains at all schools, nor 
did any item show significant decrements 
from pre- to posttest at all schools. Item 85 
did show a loss in responses at three insti- 
tutions, and only one school recorded a pre- 
posttest gain. Item 39 showed significant 
losses at two institutions. Item 25 showed 
a significant gain at one school and a sig- 
nificant loss at another. Only Items 22 and 
65 showed significant pre- to posttest gains 
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at more than one school. When items were 
grouped by content area, such as percep- 
tion, motivation, etc., there were no strik- 
ing differences in correct responses between 
categories. 

Some items in Table 2 are identified as 
representing the four factors isolated by 
Holley and Buxton in an inverted factor 
analysis (Q technique) of 20 students at 
Northwestern. They ran biserial correla- 
tions for passing an item and individual 
factor thus the items 
which most clearly differentiated individ- 
uals high and low on a particular factor. 
These items were then used to identify the 
factor. It is clear from Table 2 that aver- 
age gains for items in Factor I and those 
with positive loading on Factor II are neg- 
ligible. This might be accounted for by a 
preponderance of easy items, since over 
70% of the students passed these items on 
the pretest. However, this is not a suffi- 
cient explanation since 77% of the re- 
sponses to items in Factor III were correct 
on the pretest and significant gains still 
occurred. 


scores, obtaining 


DIscuSSION 


If one of the goals of the elementary 
psychology course is to eliminate miscon- 
ceptions about psychology and the North- 
western test is valid, instructors in none of 
the courses represented in our study can 
be very proud of their accomplishment. 
The typical student who has completed 
one of these courses retains at the end of 
the course about 30 of the 100 misconcep- 
tions, or about 85% of those with which 
he began his course in psychology. Even at 
the end of the course students believe that: 


Item 27. The sense organs of touch, in a 
person with normal vision, are just as sensi- 
tive as those in a blind person, is false. 

Item 53. Rats, cats, and dogs have the 
power to reason, is false. 

Item 72. After graduation from college, 
the average Phi Bete makes more money 
than the average non-Phi Bete, is false. 
Item 74. College men are less promiscuous 
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TABLE 1 


MEAN Scores ON MISCONCEPTIONS TEST 


Institution Pretest |Posttest Gains* N 


Carnegie Tech. | 69.2 


75.6 6.4 96 
Colgate 60.4 | 64.5 4.1 40 
Michigan | 68.8 | 73.7 4.9 57 
Northwestern* | 61.7 | 72.0 | 10.3 | 154 
Pennsylvania 61.1 | 64.8 | 3.7] 113 


State 
Trinity 64.0 | 70.4 6.4 53 


* All gains are statistically significant at the .001 
level. 

® At Northwestern correlation between pre- and post- 
test was .35. 


sexually than men of equal age but lesser 


education, is false. 
Item 91. Psychiatrists are defined as medi- 
cal men who use psychoanalysis, is true 


Validity. Different will 
view the importance of these beliefs dif- 
ferently. In fact, it is quite possible that 
psychologists themselves would not reach 
unanimity on the correct answers to the 
items. To compare student responses with 
those of instructors, the staff of the general 
psychology courses at the University of 
Michigan was asked to rate each item on 
the following scale: 


psychologists 


a. I disagree with the key 

b. I don’t care how my students answer. 

c. I'd be mildly unhappy if my students 
missed this. 

d. I'd be quite unhappy if my students 
missed this. 


The only item which was scored “d” by 
all 11 respondents was Item 26. Only seven 
other items were rated “c” or “d” by all of 
the 11 instructors. A majority of the in- 
structors chose “a” for one item. Of the 24 
items which were passed by less than 60% 
of the students at the end of the course, 
one was the item with the questionable key 
and the rest were items which at least 40% 
of the instructors rated “b”. Thus, there is 
apparently little cause for concern about 
these misconceptions. On the other hand, 
even though most students come into gen- 
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eral psychology with correct beliefs about 
the 8 items regarded as most important by 
the instructors, we have relatively little 
effect on those students who do hold these 
misconceptions. On these 8 items, 80.57% 
of the student responses were correct on 


W. J. McKEACHIE 


the pretest and 82% were correct on the 
posttest. 

What does this add up to? The author’s 
first reaction is one of disillusionment—not 
only with the extent to which our teaching 
generalizes to everyday beliefs, but also 


TABLE 2 
CHANGES IN RESPONSES TO SELECTED ITEMs ALL INSTITUTIONS COMBINED 





(N 





Item Content 


= 513) 








Scored by 
Original Pretest Posttest 
Northwestern % Correct % Correct 
Key 





A. Items reported by Holley and Buxton as having high loadings on their first factor 


53 Rats, cats, and dogs have the power to reason. ar 40 34 

87 The printing on this page is upside-down on your ar 93 93 
retina. 

91* Psychiatrists are defined as medical men who use F 41 43 
psycho-analysis. 

97 There is little that psychology can do for the normal F 93 92 


person. 








2 Cats can see in complete darkness. F 86 92 
47 A dog can sense impending disaster better than a F 47 60 
man. 
71 Half the people in this country are below average in T 54 57 
intelligence. 
76 The unconscious mind is located just above the roof F 83 85 


of the mouth, directly back of the nose. 








C. Items reported by Holley and Buxton as having high loadings on their third factor 


6* A child is born with a sense of good and evil, this is F 85 91 
his conscience. 
i The majority of adult criminals are feeble-minded F 83 86 
or very nearly so. 
37 A person who won’t look you in the eye is probably F 89 93 
untrustworthy. 
63 Being spanked may be pleasurable to a child. T 58 78 








D. Items reported by Holley and Buxton as having high loadings on their fourth factor 


27 The sense organs of touch, in a person with normal T 32 33 
vision, are just as sensitive as those in a blind 


person. 


46 It is probable that man’s instinct to fight is the fun- F 76 87 


damental cause of wars. 


65 The average infant would learn to walk two months F 51 69 
earlier than he does, if he were given proper train- 


ing. 


77 Children memorize much more easily than adults. F 37 44 
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CHANGES IN MISCONCEPTIONS 


TABLE 2.—Continued 


Scored by 


: . Original Pretest Posttest 
Item Content Northwestern % Correct % Correct 
~ 
E. Other items referred to in text 
11* The most important consideration to the industrial F 70 67 
worker is the amount of money a job pays. 
17* Most college students are not susceptible to propa F 87 90 
ganda. 
22 Very bright children tend to be at least equal to T 64 82 


children of average intelligence, with respect to 
physical development. 
25 The very bright child tends to be at least equal to T 56 70 
children of average intelligence, with respect to 
emotional adjustment. 


26* All men are created equal in capacity for achieve- F 83 SS 
ment. 

39 The 1.Q. can be changed, to a significant degree. T 52 49 

62* Worry can cause a truly physical disease. T 87 90 

67* There is no evidence that Negroes are inherently T 86 91 
(naturally) less intelligent than whites. 

72 After graduation from college, the average Phi Bete T 28 32 
makes more money than the average non-Phi 
Bete. 

74 College men are less promiscuous sexually than men T 41 37 
of equal age but lesser education. 

85 A properly-trained psychologist (not a quack) can T 64 42 


tell something about the personality of an indi- 
vidual from his handwriting alone. 
Note.—The data from Northwestern are not included in this table. Numbers passing pre- and posttest at the 


other institutions were simply added to derive this table (N = 359) 
* These items were rated as important by the Michigan staff 


with the validity of the Northwestern test rationalization; for other studies using 
as a measure of these beliefs* However, other items have arrived at similarily de- 
even though one can rationalize the rela- pressing conclusions about application and 
tively small gains by attacking the signifi- generalization of learning in psychology 
cance, wording, or item form of the North- courses (Horrocks & Nagy, 1948; Magaret 
western test, this is not a very satisfying & Sherriffs, 1947). Clearly if we’re really 

Ditties Gs Sains oll fen ee serious about aiming for generalization of 
doubts about the value of a misconceptions © application of psychological perform- 
test of this type, Byron Groesbeck and the nce, we need to investigate more carefully 
author have developed a 20-item type test what types of teaching or content are most 
consisting of 14 items from this test and six effective in producing gains in this area. 
new items. Thirteen of these meet the follow- 


ing criteria: (a) content considered worth SuMMARY 
knowing by at least 70% of the Michigan 
staff, (b) passed by less than 70% of the stu- The Northwestern Misconceptions Test 


de "y on a pretest, (c) item validity (against was administered at the beginning and end 
total score d phi or. r seve . : 
otal score) of 20 or higher. The other seven ¢F the elementary psychology course at six 
were those rated most important by the at oe : 

Michigan staff. Mimeographed copies of the "8ttutions. een alee 
test are available from the author Although all of the institutions produced 
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gains in total test score, gains or losses on 
individual items varied from institution to 
institution. 

Many of the items upon which students 
showed little improvement were regarded 
by faculty members as unimportant. How- 
ever, gains on items rated as important 
were also very small. 
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